Warning: Permanently added '2620:52:3:1:dead:beef:cafe:c15b' (ED25519) to the list of known hosts. You can reproduce this build on your computer by running: sudo dnf install copr-rpmbuild /usr/bin/copr-rpmbuild --verbose --drop-resultdir --task-url https://copr.fedorainfracloud.org/backend/get-build-task/9179453-fedora-rawhide-x86_64 --chroot fedora-rawhide-x86_64 Version: 1.3 PID: 2674 Logging PID: 2675 Task: {'allow_user_ssh': False, 'appstream': False, 'background': True, 'build_id': 9179453, 'buildroot_pkgs': [], 'chroot': 'fedora-rawhide-x86_64', 'enable_net': False, 'fedora_review': False, 'git_hash': '147ddd9d6216916e2ceda6ec87484f7f5c852d5f', 'git_repo': 'https://copr-dist-git.fedorainfracloud.org/git/@rocm-packagers-sig/RH/rccl', 'isolation': 'default', 'memory_reqs': 2048, 'package_name': 'rccl', 'package_version': '6.4.1-3', 'project_dirname': 'RH', 'project_name': 'RH', 'project_owner': '@rocm-packagers-sig', 'repo_priority': None, 'repos': [{'baseurl': 'https://download.copr.fedorainfracloud.org/results/@rocm-packagers-sig/RH/fedora-rawhide-x86_64/', 'id': 'copr_base', 'name': 'Copr repository', 'priority': None}], 'sandbox': '@rocm-packagers-sig/RH--https://src.fedoraproject.org/user/trix', 'source_json': {}, 'source_type': None, 'ssh_public_keys': None, 'storage': 0, 'submitter': 'https://src.fedoraproject.org/user/trix', 'tags': [], 'task_id': '9179453-fedora-rawhide-x86_64', 'timeout': 18000, 'uses_devel_repo': False, 'with_opts': [], 'without_opts': []} Running: git clone https://copr-dist-git.fedorainfracloud.org/git/@rocm-packagers-sig/RH/rccl /var/lib/copr-rpmbuild/workspace/workdir-mg18jaca/rccl --depth 500 --no-single-branch --recursive cmd: ['git', 'clone', 'https://copr-dist-git.fedorainfracloud.org/git/@rocm-packagers-sig/RH/rccl', '/var/lib/copr-rpmbuild/workspace/workdir-mg18jaca/rccl', '--depth', '500', '--no-single-branch', '--recursive'] cwd: . rc: 0 stdout: stderr: Cloning into '/var/lib/copr-rpmbuild/workspace/workdir-mg18jaca/rccl'... Running: git checkout 147ddd9d6216916e2ceda6ec87484f7f5c852d5f -- cmd: ['git', 'checkout', '147ddd9d6216916e2ceda6ec87484f7f5c852d5f', '--'] cwd: /var/lib/copr-rpmbuild/workspace/workdir-mg18jaca/rccl rc: 0 stdout: stderr: Note: switching to '147ddd9d6216916e2ceda6ec87484f7f5c852d5f'. You are in 'detached HEAD' state. You can look around, make experimental changes and commit them, and you can discard any commits you make in this state without impacting any branches by switching back to a branch. If you want to create a new branch to retain commits you create, you may do so (now or later) by using -c with the switch command. Example: git switch -c Or undo this operation with: git switch - Turn off this advice by setting config variable advice.detachedHead to false HEAD is now at 147ddd9 automatic import of rccl Running: dist-git-client sources cmd: ['dist-git-client', 'sources'] cwd: /var/lib/copr-rpmbuild/workspace/workdir-mg18jaca/rccl rc: 0 stdout: stderr: INFO: Reading stdout from command: git rev-parse --abbrev-ref HEAD INFO: Reading stdout from command: git rev-parse HEAD INFO: Reading sources specification file: sources INFO: Downloading RCCL-6.4.1.tar.gz INFO: Reading stdout from command: curl --help all INFO: Calling: curl -H Pragma: -o RCCL-6.4.1.tar.gz --location --connect-timeout 60 --retry 3 --retry-delay 10 --remote-time --show-error --fail --retry-all-errors https://copr-dist-git.fedorainfracloud.org/repo/pkgs/@rocm-packagers-sig/RH/rccl/RCCL-6.4.1.tar.gz/md5/d23391d405d5d3454400b9c29d986b12/RCCL-6.4.1.tar.gz % Total % Received % Xferd Average Speed Time Time Time Current Dload Upload Total Spent Left Speed tail: /var/lib/copr-rpmbuild/main.log: file truncated 100 1848k 100 1848k 0 0 18.6M 0 --:--:-- --:--:-- --:--:-- 18.8M INFO: Reading stdout from command: md5sum RCCL-6.4.1.tar.gz Running (timeout=18000): unbuffer mock --spec /var/lib/copr-rpmbuild/workspace/workdir-mg18jaca/rccl/rccl.spec --sources /var/lib/copr-rpmbuild/workspace/workdir-mg18jaca/rccl --resultdir /var/lib/copr-rpmbuild/results --uniqueext 1750253281.853958 -r /var/lib/copr-rpmbuild/results/configs/child.cfg INFO: mock.py version 6.2 starting (python version = 3.13.3, NVR = mock-6.2-1.fc42), args: /usr/libexec/mock/mock --spec /var/lib/copr-rpmbuild/workspace/workdir-mg18jaca/rccl/rccl.spec --sources /var/lib/copr-rpmbuild/workspace/workdir-mg18jaca/rccl --resultdir /var/lib/copr-rpmbuild/results --uniqueext 1750253281.853958 -r /var/lib/copr-rpmbuild/results/configs/child.cfg Start(bootstrap): init plugins INFO: tmpfs initialized INFO: selinux enabled INFO: chroot_scan: initialized INFO: compress_logs: initialized Finish(bootstrap): init plugins Start: init plugins INFO: tmpfs initialized INFO: selinux enabled INFO: chroot_scan: initialized INFO: compress_logs: initialized Finish: init plugins INFO: Signal handler active Start: run INFO: Start(/var/lib/copr-rpmbuild/workspace/workdir-mg18jaca/rccl/rccl.spec) Config(fedora-rawhide-x86_64) Start: clean chroot Finish: clean chroot Mock Version: 6.2 INFO: Mock Version: 6.2 Start(bootstrap): chroot init INFO: mounting tmpfs at /var/lib/mock/fedora-rawhide-x86_64-bootstrap-1750253281.853958/root. INFO: calling preinit hooks INFO: enabled root cache INFO: enabled package manager cache Start(bootstrap): cleaning package manager metadata Finish(bootstrap): cleaning package manager metadata INFO: Guessed host environment type: unknown INFO: Using container image: registry.fedoraproject.org/fedora:rawhide INFO: Pulling image: registry.fedoraproject.org/fedora:rawhide INFO: Tagging container image as mock-bootstrap-8a77af3b-2ae9-4cef-a41d-cec70b65b07b INFO: Checking that 7c2f4c973ff5ff47aab0ebeb8b131281c5af6772a4d80dd52035f0bd6eb22733 image matches host's architecture INFO: Copy content of container 7c2f4c973ff5ff47aab0ebeb8b131281c5af6772a4d80dd52035f0bd6eb22733 to /var/lib/mock/fedora-rawhide-x86_64-bootstrap-1750253281.853958/root INFO: mounting 7c2f4c973ff5ff47aab0ebeb8b131281c5af6772a4d80dd52035f0bd6eb22733 with podman image mount INFO: image 7c2f4c973ff5ff47aab0ebeb8b131281c5af6772a4d80dd52035f0bd6eb22733 as /var/lib/containers/storage/overlay/df32d10c34c7e3ab783f197368d7550eb87481296646b2e2065a3104872be5c7/merged INFO: umounting image 7c2f4c973ff5ff47aab0ebeb8b131281c5af6772a4d80dd52035f0bd6eb22733 (/var/lib/containers/storage/overlay/df32d10c34c7e3ab783f197368d7550eb87481296646b2e2065a3104872be5c7/merged) with podman image umount INFO: Removing image mock-bootstrap-8a77af3b-2ae9-4cef-a41d-cec70b65b07b INFO: Package manager dnf5 detected and used (fallback) INFO: Not updating bootstrap chroot, bootstrap_image_ready=True Start(bootstrap): creating root cache Finish(bootstrap): creating root cache Finish(bootstrap): chroot init Start: chroot init INFO: mounting tmpfs at /var/lib/mock/fedora-rawhide-x86_64-1750253281.853958/root. INFO: calling preinit hooks INFO: enabled root cache INFO: enabled package manager cache Start: cleaning package manager metadata Finish: cleaning package manager metadata INFO: enabled HW Info plugin INFO: Package manager dnf5 detected and used (direct choice) INFO: Buildroot is handled by package management downloaded with a bootstrap image: rpm-5.99.90-6.fc43.x86_64 rpm-sequoia-1.8.0-1.fc43.x86_64 dnf5-5.2.13.1-3.fc43.x86_64 dnf5-plugins-5.2.13.1-3.fc43.x86_64 Start: installing minimal buildroot with dnf5 Updating and loading repositories: fedora 100% | 15.6 MiB/s | 21.8 MiB | 00m01s Copr repository 100% | 1.2 MiB/s | 215.4 KiB | 00m00s Repositories loaded. Package Arch Version Repository Size Installing group/module packages: bash x86_64 5.2.37-3.fc43 fedora 8.2 MiB bzip2 x86_64 1.0.8-20.fc42 fedora 99.3 KiB coreutils x86_64 9.7-3.fc43 fedora 5.4 MiB cpio x86_64 2.15-2.fc41 fedora 1.1 MiB diffutils x86_64 3.12-2.fc43 fedora 1.6 MiB fedora-release-common noarch 43-0.16 fedora 20.4 KiB findutils x86_64 1:4.10.0-5.fc42 fedora 1.9 MiB gawk x86_64 5.3.2-1.fc43 fedora 1.8 MiB glibc-minimal-langpack x86_64 2.41.9000-15.fc43 fedora 0.0 B grep x86_64 3.12-1.fc43 fedora 1.0 MiB gzip x86_64 1.13-3.fc42 fedora 392.9 KiB info x86_64 7.2-3.fc42 fedora 357.9 KiB patch x86_64 2.8-1.fc43 fedora 226.8 KiB redhat-rpm-config noarch 343-6.fc43 fedora 181.4 KiB rpm-build x86_64 5.99.90-6.fc43 fedora 281.7 KiB sed x86_64 4.9-4.fc42 fedora 857.3 KiB shadow-utils x86_64 2:4.17.4-1.fc43 fedora 4.0 MiB tar x86_64 2:1.35-5.fc42 fedora 3.0 MiB unzip x86_64 6.0-66.fc42 fedora 390.3 KiB util-linux x86_64 2.40.4-8.fc43 fedora 3.4 MiB which x86_64 2.23-1.fc42 fedora 83.4 KiB xz x86_64 1:5.8.1-1.fc43 fedora 1.3 MiB Installing dependencies: add-determinism x86_64 0.6.0-1.fc43 fedora 2.5 MiB alternatives x86_64 1.33-1.fc43 fedora 62.2 KiB ansible-srpm-macros noarch 1-17.1.fc42 fedora 35.7 KiB audit-libs x86_64 4.0.5-1.fc43 fedora 351.3 KiB binutils x86_64 2.44-3.fc43 fedora 25.9 MiB build-reproducibility-srpm-macros noarch 0.6.0-1.fc43 fedora 735.0 B bzip2-libs x86_64 1.0.8-20.fc42 fedora 84.6 KiB ca-certificates noarch 2024.2.69_v8.0.401-5.fc42 fedora 2.6 MiB coreutils-common x86_64 9.7-3.fc43 fedora 11.3 MiB crypto-policies noarch 20250603-1.git3a584b3.fc43 fedora 142.2 KiB curl x86_64 8.14.1-1.fc43 fedora 474.1 KiB cyrus-sasl-lib x86_64 2.1.28-30.fc42 fedora 2.3 MiB debugedit x86_64 5.1-6.fc43 fedora 192.7 KiB dwz x86_64 0.16-1.fc43 fedora 287.1 KiB ed x86_64 1.21-2.fc42 fedora 146.5 KiB efi-srpm-macros noarch 6-3.fc43 fedora 40.1 KiB elfutils x86_64 0.193-2.fc43 fedora 2.9 MiB elfutils-debuginfod-client x86_64 0.193-2.fc43 fedora 83.9 KiB elfutils-default-yama-scope noarch 0.193-2.fc43 fedora 1.8 KiB elfutils-libelf x86_64 0.193-2.fc43 fedora 1.2 MiB elfutils-libs x86_64 0.193-2.fc43 fedora 683.4 KiB fedora-gpg-keys noarch 43-0.2 fedora 129.0 KiB fedora-release noarch 43-0.16 fedora 0.0 B fedora-release-identity-basic noarch 43-0.16 fedora 664.0 B fedora-repos noarch 43-0.2 fedora 4.9 KiB fedora-repos-rawhide noarch 43-0.2 fedora 2.2 KiB file x86_64 5.46-5.fc43 fedora 100.2 KiB file-libs x86_64 5.46-5.fc43 fedora 11.9 MiB filesystem x86_64 3.18-44.fc43 fedora 112.0 B filesystem-srpm-macros noarch 3.18-44.fc43 fedora 38.2 KiB fonts-srpm-macros noarch 1:2.0.5-22.fc43 fedora 55.8 KiB forge-srpm-macros noarch 0.4.0-2.fc42 fedora 38.9 KiB fpc-srpm-macros noarch 1.3-14.fc42 fedora 144.0 B gdb-minimal x86_64 16.3-3.fc43 fedora 13.2 MiB gdbm-libs x86_64 1:1.23-9.fc42 fedora 129.9 KiB ghc-srpm-macros noarch 1.9.2-2.fc42 fedora 779.0 B glibc x86_64 2.41.9000-15.fc43 fedora 6.7 MiB glibc-common x86_64 2.41.9000-15.fc43 fedora 1.0 MiB glibc-gconv-extra x86_64 2.41.9000-15.fc43 fedora 7.2 MiB gmp x86_64 1:6.3.0-3.fc43 fedora 819.2 KiB gnat-srpm-macros noarch 6-7.fc42 fedora 1.0 KiB gnupg2 x86_64 2.4.8-2.fc43 fedora 6.5 MiB gnupg2-dirmngr x86_64 2.4.8-2.fc43 fedora 618.4 KiB gnupg2-gpg-agent x86_64 2.4.8-2.fc43 fedora 671.4 KiB gnupg2-gpgconf x86_64 2.4.8-2.fc43 fedora 250.0 KiB gnupg2-keyboxd x86_64 2.4.8-2.fc43 fedora 201.4 KiB gnupg2-verify x86_64 2.4.8-2.fc43 fedora 348.5 KiB gnutls x86_64 3.8.9-5.fc43 fedora 3.6 MiB go-srpm-macros noarch 3.6.0-7.fc43 fedora 60.8 KiB gpgverify noarch 2.1-3.fc43 fedora 8.7 KiB ima-evm-utils-libs x86_64 1.6.2-5.fc43 fedora 60.7 KiB jansson x86_64 2.14-2.fc42 fedora 93.1 KiB java-srpm-macros noarch 1-4.fc43 fedora 894.0 B json-c x86_64 0.18-2.fc42 fedora 86.7 KiB kernel-srpm-macros noarch 1.0-25.fc42 fedora 1.9 KiB keyutils-libs x86_64 1.6.3-5.fc42 fedora 58.3 KiB krb5-libs x86_64 1.21.3-6.fc43 fedora 2.3 MiB libacl x86_64 2.3.2-3.fc42 fedora 38.3 KiB libarchive x86_64 3.8.1-1.fc43 fedora 951.1 KiB libassuan x86_64 2.5.7-3.fc42 fedora 167.8 KiB libattr x86_64 2.5.2-5.fc42 fedora 27.1 KiB libblkid x86_64 2.40.4-8.fc43 fedora 262.4 KiB libbrotli x86_64 1.1.0-7.fc43 fedora 833.3 KiB libcap x86_64 2.76-1.fc43 fedora 209.2 KiB libcap-ng x86_64 0.8.5-5.fc43 fedora 68.9 KiB libcom_err x86_64 1.47.2-3.fc42 fedora 67.1 KiB libcurl x86_64 8.14.1-1.fc43 fedora 895.2 KiB libeconf x86_64 0.7.9-1.fc43 fedora 64.9 KiB libevent x86_64 2.1.12-15.fc42 fedora 903.1 KiB libfdisk x86_64 2.40.4-8.fc43 fedora 372.3 KiB libffi x86_64 3.5.1-1.fc43 fedora 83.6 KiB libfsverity x86_64 1.6-2.fc42 fedora 32.5 KiB libgcc x86_64 15.1.1-2.fc43 copr_base 266.6 KiB libgcrypt x86_64 1.11.1-1.fc43 fedora 1.6 MiB libgomp x86_64 15.1.1-2.fc43 copr_base 539.1 KiB libgpg-error x86_64 1.55-1.fc43 fedora 915.3 KiB libidn2 x86_64 2.3.8-1.fc43 fedora 552.5 KiB libksba x86_64 1.6.7-3.fc42 fedora 402.5 KiB libmount x86_64 2.40.4-8.fc43 fedora 356.3 KiB libnghttp2 x86_64 1.65.0-1.fc43 fedora 162.2 KiB libpkgconf x86_64 2.3.0-2.fc42 fedora 78.1 KiB libpsl x86_64 0.21.5-5.fc42 fedora 76.4 KiB libselinux x86_64 3.8-3.fc43 fedora 193.1 KiB libsemanage x86_64 3.8.1-3.fc43 fedora 304.4 KiB libsepol x86_64 3.8-1.fc42 fedora 826.0 KiB libsmartcols x86_64 2.40.4-8.fc43 fedora 176.4 KiB libssh x86_64 0.11.1-4.fc42 fedora 565.5 KiB libssh-config noarch 0.11.1-4.fc42 fedora 277.0 B libstdc++ x86_64 15.1.1-2.fc43 copr_base 2.8 MiB libtasn1 x86_64 4.20.0-1.fc43 fedora 176.3 KiB libtool-ltdl x86_64 2.5.4-4.fc42 fedora 70.1 KiB libunistring x86_64 1.1-9.fc42 fedora 1.7 MiB libusb1 x86_64 1.0.28-2.fc43 fedora 171.0 KiB libuuid x86_64 2.40.4-8.fc43 fedora 37.3 KiB libverto x86_64 0.3.2-10.fc42 fedora 25.4 KiB libxcrypt x86_64 4.4.38-7.fc43 fedora 284.5 KiB libxml2 x86_64 2.12.10-2.fc43 fedora 1.7 MiB libzstd x86_64 1.5.7-1.fc43 fedora 807.8 KiB lua-libs x86_64 5.4.8-1.fc43 fedora 280.8 KiB lua-srpm-macros noarch 1-15.fc42 fedora 1.3 KiB lz4-libs x86_64 1.10.0-2.fc42 fedora 157.4 KiB mpfr x86_64 4.2.2-1.fc43 fedora 828.8 KiB ncurses-base noarch 6.5-5.20250125.fc42 fedora 326.8 KiB ncurses-libs x86_64 6.5-5.20250125.fc42 fedora 946.3 KiB nettle x86_64 3.10.1-1.fc43 fedora 790.5 KiB npth x86_64 1.8-2.fc42 fedora 49.6 KiB ocaml-srpm-macros noarch 10-4.fc42 fedora 1.9 KiB openblas-srpm-macros noarch 2-19.fc42 fedora 112.0 B openldap x86_64 2.6.10-1.fc43 fedora 655.8 KiB openssl-libs x86_64 1:3.5.0-5.fc43 fedora 8.9 MiB p11-kit x86_64 0.25.5-8.fc43 fedora 2.2 MiB p11-kit-trust x86_64 0.25.5-8.fc43 fedora 395.5 KiB package-notes-srpm-macros noarch 0.5-13.fc42 fedora 1.6 KiB pam-libs x86_64 1.7.0-4.fc42 fedora 126.7 KiB pcre2 x86_64 10.45-1.fc43 fedora 697.7 KiB pcre2-syntax noarch 10.45-1.fc43 fedora 273.9 KiB perl-srpm-macros noarch 1-57.fc42 fedora 861.0 B pkgconf x86_64 2.3.0-2.fc42 fedora 88.5 KiB pkgconf-m4 noarch 2.3.0-2.fc42 fedora 14.4 KiB pkgconf-pkg-config x86_64 2.3.0-2.fc42 fedora 989.0 B popt x86_64 1.19-8.fc42 fedora 132.8 KiB publicsuffix-list-dafsa noarch 20250116-1.fc42 fedora 68.5 KiB pyproject-srpm-macros noarch 1.18.1-1.fc43 fedora 1.9 KiB python-srpm-macros noarch 3.14-1.fc43 fedora 51.7 KiB qt5-srpm-macros noarch 5.15.17-1.fc43 fedora 500.0 B qt6-srpm-macros noarch 6.9.1-1.fc43 fedora 464.0 B readline x86_64 8.2-13.fc43 fedora 485.0 KiB rpm x86_64 5.99.90-6.fc43 fedora 3.1 MiB rpm-build-libs x86_64 5.99.90-6.fc43 fedora 264.4 KiB rpm-libs x86_64 5.99.90-6.fc43 fedora 929.8 KiB rpm-sequoia x86_64 1.8.0-1.fc43 fedora 2.5 MiB rpm-sign-libs x86_64 5.99.90-6.fc43 fedora 39.7 KiB rust-srpm-macros noarch 26.3-4.fc42 fedora 4.8 KiB setup noarch 2.15.0-25.fc43 fedora 725.0 KiB sqlite-libs x86_64 3.50.0-1.fc43 fedora 1.5 MiB systemd-libs x86_64 257.6-1.fc43 fedora 2.2 MiB systemd-standalone-sysusers x86_64 257.6-1.fc43 fedora 277.3 KiB tpm2-tss x86_64 4.1.3-7.fc43 fedora 1.6 MiB tree-sitter-srpm-macros noarch 0.4.1-1.fc43 fedora 8.2 KiB util-linux-core x86_64 2.40.4-8.fc43 fedora 1.4 MiB xxhash-libs x86_64 0.8.3-2.fc42 fedora 90.2 KiB xz-libs x86_64 1:5.8.1-1.fc43 fedora 217.8 KiB zig-srpm-macros noarch 1-4.fc42 fedora 1.1 KiB zip x86_64 3.0-43.fc42 fedora 698.5 KiB zlib-ng-compat x86_64 2.2.4-2.fc43 fedora 137.6 KiB zstd x86_64 1.5.7-1.fc43 fedora 1.7 MiB Installing groups: Buildsystem building group Transaction Summary: Installing: 168 packages Total size of inbound packages is 58 MiB. Need to download 58 MiB. After this operation, 197 MiB extra will be used (install 197 MiB, remove 0 B). [ 1/168] bzip2-0:1.0.8-20.fc42.x86_64 100% | 964.2 KiB/s | 52.1 KiB | 00m00s [ 2/168] cpio-0:2.15-2.fc41.x86_64 100% | 8.9 MiB/s | 291.8 KiB | 00m00s [ 3/168] diffutils-0:3.12-2.fc43.x86_6 100% | 9.4 MiB/s | 392.7 KiB | 00m00s [ 4/168] coreutils-0:9.7-3.fc43.x86_64 100% | 8.6 MiB/s | 1.1 MiB | 00m00s [ 5/168] bash-0:5.2.37-3.fc43.x86_64 100% | 13.3 MiB/s | 1.8 MiB | 00m00s [ 6/168] fedora-release-common-0:43-0. 100% | 2.5 MiB/s | 25.9 KiB | 00m00s [ 7/168] glibc-minimal-langpack-0:2.41 100% | 1.7 MiB/s | 26.7 KiB | 00m00s [ 8/168] findutils-1:4.10.0-5.fc42.x86 100% | 18.6 MiB/s | 551.5 KiB | 00m00s [ 9/168] grep-0:3.12-1.fc43.x86_64 100% | 12.7 MiB/s | 299.5 KiB | 00m00s [ 10/168] gzip-0:1.13-3.fc42.x86_64 100% | 9.8 MiB/s | 170.4 KiB | 00m00s [ 11/168] info-0:7.2-3.fc42.x86_64 100% | 10.6 MiB/s | 183.8 KiB | 00m00s [ 12/168] patch-0:2.8-1.fc43.x86_64 100% | 6.9 MiB/s | 113.7 KiB | 00m00s [ 13/168] redhat-rpm-config-0:343-6.fc4 100% | 5.5 MiB/s | 79.4 KiB | 00m00s [ 14/168] rpm-build-0:5.99.90-6.fc43.x8 100% | 13.0 MiB/s | 132.7 KiB | 00m00s [ 15/168] sed-0:4.9-4.fc42.x86_64 100% | 16.3 MiB/s | 317.3 KiB | 00m00s [ 16/168] shadow-utils-2:4.17.4-1.fc43. 100% | 25.4 MiB/s | 1.3 MiB | 00m00s [ 17/168] unzip-0:6.0-66.fc42.x86_64 100% | 5.0 MiB/s | 184.6 KiB | 00m00s [ 18/168] tar-2:1.35-5.fc42.x86_64 100% | 16.2 MiB/s | 862.5 KiB | 00m00s [ 19/168] which-0:2.23-1.fc42.x86_64 100% | 1.1 MiB/s | 41.7 KiB | 00m00s [ 20/168] xz-1:5.8.1-1.fc43.x86_64 100% | 10.0 MiB/s | 572.5 KiB | 00m00s [ 21/168] util-linux-0:2.40.4-8.fc43.x8 100% | 15.0 MiB/s | 1.2 MiB | 00m00s [ 22/168] gawk-0:5.3.2-1.fc43.x86_64 100% | 10.4 MiB/s | 1.1 MiB | 00m00s [ 23/168] filesystem-0:3.18-44.fc43.x86 100% | 22.2 MiB/s | 1.3 MiB | 00m00s [ 24/168] bzip2-libs-0:1.0.8-20.fc42.x8 100% | 3.9 MiB/s | 43.6 KiB | 00m00s [ 25/168] ncurses-libs-0:6.5-5.20250125 100% | 21.8 MiB/s | 335.0 KiB | 00m00s [ 26/168] glibc-0:2.41.9000-15.fc43.x86 100% | 27.0 MiB/s | 2.2 MiB | 00m00s [ 27/168] gmp-1:6.3.0-3.fc43.x86_64 100% | 4.6 MiB/s | 322.2 KiB | 00m00s [ 28/168] libacl-0:2.3.2-3.fc42.x86_64 100% | 1.9 MiB/s | 23.0 KiB | 00m00s [ 29/168] libattr-0:2.5.2-5.fc42.x86_64 100% | 1.4 MiB/s | 17.1 KiB | 00m00s [ 30/168] libcap-0:2.76-1.fc43.x86_64 100% | 6.5 MiB/s | 86.9 KiB | 00m00s [ 31/168] libselinux-0:3.8-3.fc43.x86_6 100% | 6.3 MiB/s | 96.7 KiB | 00m00s [ 32/168] coreutils-common-0:9.7-3.fc43 100% | 20.4 MiB/s | 2.1 MiB | 00m00s [ 33/168] fedora-repos-0:43-0.2.noarch 100% | 1.1 MiB/s | 9.2 KiB | 00m00s [ 34/168] systemd-libs-0:257.6-1.fc43.x 100% | 27.5 MiB/s | 789.6 KiB | 00m00s [ 35/168] glibc-common-0:2.41.9000-15.f 100% | 18.0 MiB/s | 313.3 KiB | 00m00s [ 36/168] ed-0:1.21-2.fc42.x86_64 100% | 7.3 MiB/s | 82.0 KiB | 00m00s [ 37/168] pcre2-0:10.45-1.fc43.x86_64 100% | 17.1 MiB/s | 262.8 KiB | 00m00s [ 38/168] ansible-srpm-macros-0:1-17.1. 100% | 2.2 MiB/s | 20.3 KiB | 00m00s [ 39/168] build-reproducibility-srpm-ma 100% | 1.4 MiB/s | 11.7 KiB | 00m00s [ 40/168] openssl-libs-1:3.5.0-5.fc43.x 100% | 36.7 MiB/s | 2.6 MiB | 00m00s [ 41/168] efi-srpm-macros-0:6-3.fc43.no 100% | 1.8 MiB/s | 22.5 KiB | 00m00s [ 42/168] dwz-0:0.16-1.fc43.x86_64 100% | 8.3 MiB/s | 135.5 KiB | 00m00s [ 43/168] file-0:5.46-5.fc43.x86_64 100% | 4.8 MiB/s | 48.8 KiB | 00m00s [ 44/168] filesystem-srpm-macros-0:3.18 100% | 2.5 MiB/s | 26.0 KiB | 00m00s [ 45/168] fonts-srpm-macros-1:2.0.5-22. 100% | 3.3 MiB/s | 27.2 KiB | 00m00s [ 46/168] fpc-srpm-macros-0:1.3-14.fc42 100% | 1.1 MiB/s | 8.0 KiB | 00m00s [ 47/168] forge-srpm-macros-0:0.4.0-2.f 100% | 2.4 MiB/s | 19.9 KiB | 00m00s [ 48/168] ghc-srpm-macros-0:1.9.2-2.fc4 100% | 1.1 MiB/s | 9.2 KiB | 00m00s [ 49/168] gnat-srpm-macros-0:6-7.fc42.n 100% | 1.2 MiB/s | 8.6 KiB | 00m00s [ 50/168] java-srpm-macros-0:1-4.fc43.n 100% | 1.1 MiB/s | 7.7 KiB | 00m00s [ 51/168] go-srpm-macros-0:3.6.0-7.fc43 100% | 3.4 MiB/s | 27.6 KiB | 00m00s [ 52/168] kernel-srpm-macros-0:1.0-25.f 100% | 1.2 MiB/s | 9.9 KiB | 00m00s [ 53/168] lua-srpm-macros-0:1-15.fc42.n 100% | 1.2 MiB/s | 8.9 KiB | 00m00s [ 54/168] ocaml-srpm-macros-0:10-4.fc42 100% | 1.1 MiB/s | 9.2 KiB | 00m00s [ 55/168] package-notes-srpm-macros-0:0 100% | 1.3 MiB/s | 9.3 KiB | 00m00s [ 56/168] openblas-srpm-macros-0:2-19.f 100% | 970.7 KiB/s | 7.8 KiB | 00m00s [ 57/168] perl-srpm-macros-0:1-57.fc42. 100% | 1.0 MiB/s | 8.5 KiB | 00m00s [ 58/168] python-srpm-macros-0:3.14-1.f 100% | 2.8 MiB/s | 23.2 KiB | 00m00s [ 59/168] pyproject-srpm-macros-0:1.18. 100% | 1.7 MiB/s | 13.9 KiB | 00m00s [ 60/168] qt5-srpm-macros-0:5.15.17-1.f 100% | 1.1 MiB/s | 8.7 KiB | 00m00s [ 61/168] qt6-srpm-macros-0:6.9.1-1.fc4 100% | 1.1 MiB/s | 9.4 KiB | 00m00s [ 62/168] rust-srpm-macros-0:26.3-4.fc4 100% | 1.4 MiB/s | 11.7 KiB | 00m00s [ 63/168] tree-sitter-srpm-macros-0:0.4 100% | 1.3 MiB/s | 13.0 KiB | 00m00s [ 64/168] zig-srpm-macros-0:1-4.fc42.no 100% | 824.4 KiB/s | 8.2 KiB | 00m00s [ 65/168] rpm-0:5.99.90-6.fc43.x86_64 100% | 24.6 MiB/s | 553.2 KiB | 00m00s [ 66/168] debugedit-0:5.1-6.fc43.x86_64 100% | 8.5 MiB/s | 78.7 KiB | 00m00s [ 67/168] zip-0:3.0-43.fc42.x86_64 100% | 21.4 MiB/s | 263.5 KiB | 00m00s [ 68/168] elfutils-0:0.193-2.fc43.x86_6 100% | 21.5 MiB/s | 571.5 KiB | 00m00s [ 69/168] elfutils-libelf-0:0.193-2.fc4 100% | 10.7 MiB/s | 207.9 KiB | 00m00s [ 70/168] libarchive-0:3.8.1-1.fc43.x86 100% | 18.7 MiB/s | 421.4 KiB | 00m00s [ 71/168] popt-0:1.19-8.fc42.x86_64 100% | 4.6 MiB/s | 66.0 KiB | 00m00s [ 72/168] readline-0:8.2-13.fc43.x86_64 100% | 11.6 MiB/s | 212.9 KiB | 00m00s [ 73/168] rpm-build-libs-0:5.99.90-6.fc 100% | 2.3 MiB/s | 126.8 KiB | 00m00s [ 74/168] rpm-libs-0:5.99.90-6.fc43.x86 100% | 7.7 MiB/s | 399.7 KiB | 00m00s [ 75/168] zstd-0:1.5.7-1.fc43.x86_64 100% | 9.1 MiB/s | 485.8 KiB | 00m00s [ 76/168] audit-libs-0:4.0.5-1.fc43.x86 100% | 10.6 MiB/s | 130.8 KiB | 00m00s [ 77/168] libeconf-0:0.7.9-1.fc43.x86_6 100% | 3.4 MiB/s | 35.2 KiB | 00m00s [ 78/168] libsemanage-0:3.8.1-3.fc43.x8 100% | 8.6 MiB/s | 123.3 KiB | 00m00s [ 79/168] libxcrypt-0:4.4.38-7.fc43.x86 100% | 8.3 MiB/s | 127.2 KiB | 00m00s [ 80/168] pam-libs-0:1.7.0-4.fc42.x86_6 100% | 4.4 MiB/s | 58.3 KiB | 00m00s [ 81/168] setup-0:2.15.0-25.fc43.noarch 100% | 11.8 MiB/s | 157.6 KiB | 00m00s [ 82/168] xz-libs-1:5.8.1-1.fc43.x86_64 100% | 8.5 MiB/s | 113.0 KiB | 00m00s [ 83/168] mpfr-0:4.2.2-1.fc43.x86_64 100% | 16.1 MiB/s | 346.7 KiB | 00m00s [ 84/168] libblkid-0:2.40.4-8.fc43.x86_ 100% | 8.0 MiB/s | 122.5 KiB | 00m00s [ 85/168] libcap-ng-0:0.8.5-5.fc43.x86_ 100% | 2.2 MiB/s | 32.2 KiB | 00m00s [ 86/168] libfdisk-0:2.40.4-8.fc43.x86_ 100% | 9.1 MiB/s | 158.4 KiB | 00m00s [ 87/168] libmount-0:2.40.4-8.fc43.x86_ 100% | 9.4 MiB/s | 154.4 KiB | 00m00s [ 88/168] libsmartcols-0:2.40.4-8.fc43. 100% | 5.3 MiB/s | 81.6 KiB | 00m00s [ 89/168] libuuid-0:2.40.4-8.fc43.x86_6 100% | 3.2 MiB/s | 25.9 KiB | 00m00s [ 90/168] zlib-ng-compat-0:2.2.4-2.fc43 100% | 7.0 MiB/s | 79.1 KiB | 00m00s [ 91/168] util-linux-core-0:2.40.4-8.fc 100% | 22.5 MiB/s | 529.3 KiB | 00m00s [ 92/168] ncurses-base-0:6.5-5.20250125 100% | 5.4 MiB/s | 88.1 KiB | 00m00s [ 93/168] libsepol-0:3.8-1.fc42.x86_64 100% | 4.2 MiB/s | 348.9 KiB | 00m00s [ 94/168] glibc-gconv-extra-0:2.41.9000 100% | 15.4 MiB/s | 1.6 MiB | 00m00s [ 95/168] fedora-gpg-keys-0:43-0.2.noar 100% | 6.7 MiB/s | 136.6 KiB | 00m00s [ 96/168] crypto-policies-0:20250603-1. 100% | 3.1 MiB/s | 97.7 KiB | 00m00s [ 97/168] ca-certificates-0:2024.2.69_v 100% | 8.5 MiB/s | 945.0 KiB | 00m00s [ 98/168] fedora-repos-rawhide-0:43-0.2 100% | 1.2 MiB/s | 8.8 KiB | 00m00s [ 99/168] pcre2-syntax-0:10.45-1.fc43.n 100% | 12.1 MiB/s | 161.7 KiB | 00m00s [100/168] curl-0:8.14.1-1.fc43.x86_64 100% | 4.8 MiB/s | 234.3 KiB | 00m00s [101/168] file-libs-0:5.46-5.fc43.x86_6 100% | 13.8 MiB/s | 849.8 KiB | 00m00s [102/168] add-determinism-0:0.6.0-1.fc4 100% | 14.5 MiB/s | 918.3 KiB | 00m00s [103/168] elfutils-debuginfod-client-0: 100% | 5.7 MiB/s | 47.0 KiB | 00m00s [104/168] elfutils-libs-0:0.193-2.fc43. 100% | 15.5 MiB/s | 270.2 KiB | 00m00s [105/168] libzstd-0:1.5.7-1.fc43.x86_64 100% | 14.6 MiB/s | 314.8 KiB | 00m00s [106/168] lz4-libs-0:1.10.0-2.fc42.x86_ 100% | 5.4 MiB/s | 78.1 KiB | 00m00s [107/168] lua-libs-0:5.4.8-1.fc43.x86_6 100% | 7.2 MiB/s | 131.9 KiB | 00m00s [108/168] libxml2-0:2.12.10-2.fc43.x86_ 100% | 20.5 MiB/s | 691.3 KiB | 00m00s [109/168] rpm-sign-libs-0:5.99.90-6.fc4 100% | 2.0 MiB/s | 28.6 KiB | 00m00s [110/168] elfutils-default-yama-scope-0 100% | 968.1 KiB/s | 12.6 KiB | 00m00s [111/168] json-c-0:0.18-2.fc42.x86_64 100% | 2.6 MiB/s | 44.9 KiB | 00m00s [112/168] sqlite-libs-0:3.50.0-1.fc43.x 100% | 16.9 MiB/s | 761.3 KiB | 00m00s [113/168] rpm-sequoia-0:1.8.0-1.fc43.x8 100% | 19.9 MiB/s | 938.8 KiB | 00m00s [114/168] libfsverity-0:1.6-2.fc42.x86_ 100% | 481.6 KiB/s | 18.8 KiB | 00m00s [115/168] ima-evm-utils-libs-0:1.6.2-5. 100% | 720.5 KiB/s | 29.5 KiB | 00m00s [116/168] gpgverify-0:2.1-3.fc43.noarch 100% | 1.5 MiB/s | 10.8 KiB | 00m00s [117/168] gnupg2-dirmngr-0:2.4.8-2.fc43 100% | 15.8 MiB/s | 274.8 KiB | 00m00s [118/168] gnupg2-gpg-agent-0:2.4.8-2.fc 100% | 17.8 MiB/s | 273.0 KiB | 00m00s [119/168] gnupg2-gpgconf-0:2.4.8-2.fc43 100% | 10.2 MiB/s | 115.2 KiB | 00m00s [120/168] gnupg2-keyboxd-0:2.4.8-2.fc43 100% | 10.3 MiB/s | 94.8 KiB | 00m00s [121/168] gnupg2-0:2.4.8-2.fc43.x86_64 100% | 18.5 MiB/s | 1.6 MiB | 00m00s [122/168] libassuan-0:2.5.7-3.fc42.x86_ 100% | 8.2 MiB/s | 67.6 KiB | 00m00s [123/168] gnupg2-verify-0:2.4.8-2.fc43. 100% | 12.9 MiB/s | 171.3 KiB | 00m00s [124/168] npth-0:1.8-2.fc42.x86_64 100% | 3.2 MiB/s | 25.8 KiB | 00m00s [125/168] libgpg-error-0:1.55-1.fc43.x8 100% | 17.0 MiB/s | 244.1 KiB | 00m00s [126/168] libgcrypt-0:1.11.1-1.fc43.x86 100% | 30.6 MiB/s | 596.1 KiB | 00m00s [127/168] libksba-0:1.6.7-3.fc42.x86_64 100% | 15.8 MiB/s | 162.0 KiB | 00m00s [128/168] tpm2-tss-0:4.1.3-7.fc43.x86_6 100% | 19.7 MiB/s | 423.4 KiB | 00m00s [129/168] openldap-0:2.6.10-1.fc43.x86_ 100% | 21.1 MiB/s | 259.2 KiB | 00m00s [130/168] libusb1-0:1.0.28-2.fc43.x86_6 100% | 8.6 MiB/s | 79.3 KiB | 00m00s [131/168] libidn2-0:2.3.8-1.fc43.x86_64 100% | 17.1 MiB/s | 174.8 KiB | 00m00s [132/168] gnutls-0:3.8.9-5.fc43.x86_64 100% | 33.3 MiB/s | 1.2 MiB | 00m00s [133/168] libtasn1-0:4.20.0-1.fc43.x86_ 100% | 7.3 MiB/s | 75.0 KiB | 00m00s [134/168] nettle-0:3.10.1-1.fc43.x86_64 100% | 24.4 MiB/s | 424.6 KiB | 00m00s [135/168] libunistring-0:1.1-9.fc42.x86 100% | 26.5 MiB/s | 542.5 KiB | 00m00s [136/168] p11-kit-0:0.25.5-8.fc43.x86_6 100% | 22.7 MiB/s | 488.2 KiB | 00m00s [137/168] libtool-ltdl-0:2.5.4-4.fc42.x 100% | 3.9 MiB/s | 36.2 KiB | 00m00s [138/168] libevent-0:2.1.12-15.fc42.x86 100% | 19.5 MiB/s | 260.2 KiB | 00m00s [139/168] cyrus-sasl-lib-0:2.1.28-30.fc 100% | 31.0 MiB/s | 793.5 KiB | 00m00s [140/168] libffi-0:3.5.1-1.fc43.x86_64 100% | 3.6 MiB/s | 40.9 KiB | 00m00s [141/168] gdbm-libs-1:1.23-9.fc42.x86_6 100% | 5.6 MiB/s | 57.0 KiB | 00m00s [142/168] libgcc-0:15.1.1-2.fc43.x86_64 100% | 2.5 MiB/s | 128.0 KiB | 00m00s [143/168] libgomp-0:15.1.1-2.fc43.x86_6 100% | 4.7 MiB/s | 365.7 KiB | 00m00s [144/168] libstdc++-0:15.1.1-2.fc43.x86 100% | 11.1 MiB/s | 913.3 KiB | 00m00s [145/168] alternatives-0:1.33-1.fc43.x8 100% | 1.4 MiB/s | 40.5 KiB | 00m00s [146/168] jansson-0:2.14-2.fc42.x86_64 100% | 1.9 MiB/s | 45.7 KiB | 00m00s [147/168] pkgconf-pkg-config-0:2.3.0-2. 100% | 1.2 MiB/s | 9.9 KiB | 00m00s [148/168] pkgconf-0:2.3.0-2.fc42.x86_64 100% | 5.5 MiB/s | 44.9 KiB | 00m00s [149/168] pkgconf-m4-0:2.3.0-2.fc42.noa 100% | 2.0 MiB/s | 14.2 KiB | 00m00s [150/168] libpkgconf-0:2.3.0-2.fc42.x86 100% | 661.5 KiB/s | 38.4 KiB | 00m00s [151/168] p11-kit-trust-0:0.25.5-8.fc43 100% | 2.5 MiB/s | 132.4 KiB | 00m00s [152/168] fedora-release-0:43-0.16.noar 100% | 1.8 MiB/s | 14.9 KiB | 00m00s [153/168] systemd-standalone-sysusers-0 100% | 11.9 MiB/s | 134.4 KiB | 00m00s [154/168] xxhash-libs-0:0.8.3-2.fc42.x8 100% | 4.2 MiB/s | 39.1 KiB | 00m00s [155/168] fedora-release-identity-basic 100% | 1.3 MiB/s | 15.6 KiB | 00m00s [156/168] binutils-0:2.44-3.fc43.x86_64 100% | 33.6 MiB/s | 5.8 MiB | 00m00s [157/168] libcurl-0:8.14.1-1.fc43.x86_6 100% | 7.0 MiB/s | 400.0 KiB | 00m00s [158/168] krb5-libs-0:1.21.3-6.fc43.x86 100% | 16.9 MiB/s | 759.5 KiB | 00m00s [159/168] libnghttp2-0:1.65.0-1.fc43.x8 100% | 7.9 MiB/s | 72.6 KiB | 00m00s [160/168] libbrotli-0:1.1.0-7.fc43.x86_ 100% | 23.7 MiB/s | 339.1 KiB | 00m00s [161/168] libpsl-0:0.21.5-5.fc42.x86_64 100% | 6.3 MiB/s | 64.0 KiB | 00m00s [162/168] libssh-0:0.11.1-4.fc42.x86_64 100% | 19.0 MiB/s | 233.3 KiB | 00m00s [163/168] keyutils-libs-0:1.6.3-5.fc42. 100% | 3.8 MiB/s | 31.5 KiB | 00m00s [164/168] libcom_err-0:1.47.2-3.fc42.x8 100% | 3.3 MiB/s | 26.9 KiB | 00m00s [165/168] libverto-0:0.3.2-10.fc42.x86_ 100% | 2.5 MiB/s | 20.8 KiB | 00m00s [166/168] publicsuffix-list-dafsa-0:202 100% | 7.2 MiB/s | 58.8 KiB | 00m00s [167/168] gdb-minimal-0:16.3-3.fc43.x86 100% | 32.4 MiB/s | 4.4 MiB | 00m00s [168/168] libssh-config-0:0.11.1-4.fc42 100% | 750.2 KiB/s | 9.0 KiB | 00m00s -------------------------------------------------------------------------------- [168/168] Total 100% | 36.7 MiB/s | 58.4 MiB | 00m02s Running transaction Importing OpenPGP key 0x31645531: UserID : "Fedora (43) " Fingerprint: C6E7F081CF80E13146676E88829B606631645531 From : file:///usr/share/distribution-gpg-keys/fedora/RPM-GPG-KEY-fedora-43-primary The key was successfully imported. Importing OpenPGP key 0x31645531: UserID : "Fedora (43) " Fingerprint: C6E7F081CF80E13146676E88829B606631645531 From : file:///usr/share/distribution-gpg-keys/fedora/RPM-GPG-KEY-fedora-43-primary The key was successfully imported. Importing OpenPGP key 0x105EF944: UserID : "Fedora (42) " Fingerprint: B0F4950458F69E1150C6C5EDC8AC4916105EF944 From : file:///usr/share/distribution-gpg-keys/fedora/RPM-GPG-KEY-fedora-42-primary The key was successfully imported. Importing OpenPGP key 0x6D9F90A6: UserID : "Fedora (44) " Fingerprint: 36F612DCF27F7D1A48A835E4DBFCF71C6D9F90A6 From : file:///usr/share/distribution-gpg-keys/fedora/RPM-GPG-KEY-fedora-44-primary The key was successfully imported. [ 1/170] Verify package files 100% | 1.4 KiB/s | 168.0 B | 00m00s >>> Running pre-transaction scriptlet: filesystem-0:3.18-44.fc43.x86_64 >>> Finished pre-transaction scriptlet: filesystem-0:3.18-44.fc43.x86_64 >>> [RPM] /var/lib/mock/fedora-rawhide-x86_64-1750253281.853958/root/var/cache/d [ 2/170] Prepare transaction 100% | 2.0 KiB/s | 168.0 B | 00m00s [ 3/170] Installing libgcc-0:15.1.1-2. 100% | 131.0 MiB/s | 268.3 KiB | 00m00s [ 4/170] Installing libssh-config-0:0. 100% | 796.9 KiB/s | 816.0 B | 00m00s [ 5/170] Installing publicsuffix-list- 100% | 67.6 MiB/s | 69.2 KiB | 00m00s [ 6/170] Installing fedora-release-ide 100% | 898.4 KiB/s | 920.0 B | 00m00s [ 7/170] Installing fedora-gpg-keys-0: 100% | 19.1 MiB/s | 175.9 KiB | 00m00s [ 8/170] Installing fedora-repos-rawhi 100% | 0.0 B/s | 2.4 KiB | 00m00s [ 9/170] Installing fedora-repos-0:43- 100% | 5.6 MiB/s | 5.7 KiB | 00m00s [ 10/170] Installing fedora-release-com 100% | 12.1 MiB/s | 24.7 KiB | 00m00s [ 11/170] Installing fedora-release-0:4 100% | 7.6 KiB/s | 124.0 B | 00m00s >>> Running sysusers scriptlet: setup-0:2.15.0-25.fc43.noarch >>> Finished sysusers scriptlet: setup-0:2.15.0-25.fc43.noarch >>> Scriptlet output: >>> Creating group 'adm' with GID 4. >>> Creating group 'audio' with GID 63. >>> Creating group 'cdrom' with GID 11. >>> Creating group 'clock' with GID 103. >>> Creating group 'dialout' with GID 18. >>> Creating group 'disk' with GID 6. >>> Creating group 'floppy' with GID 19. >>> Creating group 'ftp' with GID 50. >>> Creating group 'games' with GID 20. >>> Creating group 'input' with GID 104. >>> Creating group 'kmem' with GID 9. >>> Creating group 'kvm' with GID 36. >>> Creating group 'lock' with GID 54. >>> Creating group 'lp' with GID 7. >>> Creating group 'mail' with GID 12. >>> Creating group 'man' with GID 15. >>> Creating group 'mem' with GID 8. >>> Creating group 'nobody' with GID 65534. >>> Creating group 'render' with GID 105. >>> Creating group 'root' with GID 0. >>> Creating group 'sgx' with GID 106. >>> Creating group 'sys' with GID 3. >>> Creating group 'tape' with GID 33. >>> Creating group 'tty' with GID 5. >>> Creating group 'users' with GID 100. >>> Creating group 'utmp' with GID 22. >>> Creating group 'video' with GID 39. >>> Creating group 'wheel' with GID 10. >>> Creating user 'adm' (adm) with UID 3 and GID 4. >>> Creating group 'bin' with GID 1. >>> Creating user 'bin' (bin) with UID 1 and GID 1. >>> Creating group 'daemon' with GID 2. >>> Creating user 'daemon' (daemon) with UID 2 and GID 2. >>> Creating user 'ftp' (FTP User) with UID 14 and GID 50. >>> Creating user 'games' (games) with UID 12 and GID 100. >>> Creating user 'halt' (halt) with UID 7 and GID 0. >>> Creating user 'lp' (lp) with UID 4 and GID 7. >>> Creating user 'mail' (mail) with UID 8 and GID 12. >>> Creating user 'nobody' (Kernel Overflow User) with UID 65534 and GID 65534. >>> Creating user 'operator' (operator) with UID 11 and GID 0. >>> Creating user 'root' (Super User) with UID 0 and GID 0. >>> Creating user 'shutdown' (shutdown) with UID 6 and GID 0. >>> Creating user 'sync' (sync) with UID 5 and GID 0. >>> [ 12/170] Installing setup-0:2.15.0-25. 100% | 35.7 MiB/s | 730.6 KiB | 00m00s >>> [RPM] /etc/hosts created as /etc/hosts.rpmnew [ 13/170] Installing filesystem-0:3.18- 100% | 1.4 MiB/s | 212.5 KiB | 00m00s [ 14/170] Installing pkgconf-m4-0:2.3.0 100% | 14.5 MiB/s | 14.8 KiB | 00m00s [ 15/170] Installing pcre2-syntax-0:10. 100% | 135.0 MiB/s | 276.4 KiB | 00m00s [ 16/170] Installing ncurses-base-0:6.5 100% | 34.4 MiB/s | 352.2 KiB | 00m00s [ 17/170] Installing bash-0:5.2.37-3.fc 100% | 194.8 MiB/s | 8.2 MiB | 00m00s [ 18/170] Installing glibc-common-0:2.4 100% | 53.7 MiB/s | 1.0 MiB | 00m00s [ 19/170] Installing glibc-gconv-extra- 100% | 146.2 MiB/s | 7.3 MiB | 00m00s [ 20/170] Installing glibc-0:2.41.9000- 100% | 142.1 MiB/s | 6.7 MiB | 00m00s [ 21/170] Installing ncurses-libs-0:6.5 100% | 186.1 MiB/s | 952.8 KiB | 00m00s [ 22/170] Installing glibc-minimal-lang 100% | 0.0 B/s | 124.0 B | 00m00s [ 23/170] Installing zlib-ng-compat-0:2 100% | 135.2 MiB/s | 138.4 KiB | 00m00s [ 24/170] Installing bzip2-libs-0:1.0.8 100% | 83.7 MiB/s | 85.7 KiB | 00m00s [ 25/170] Installing libgpg-error-0:1.5 100% | 50.0 MiB/s | 921.1 KiB | 00m00s [ 26/170] Installing libstdc++-0:15.1.1 100% | 257.8 MiB/s | 2.8 MiB | 00m00s [ 27/170] Installing xz-libs-1:5.8.1-1. 100% | 213.8 MiB/s | 218.9 KiB | 00m00s [ 28/170] Installing libassuan-0:2.5.7- 100% | 165.6 MiB/s | 169.6 KiB | 00m00s [ 29/170] Installing libgcrypt-0:1.11.1 100% | 262.5 MiB/s | 1.6 MiB | 00m00s [ 30/170] Installing readline-0:8.2-13. 100% | 237.8 MiB/s | 487.1 KiB | 00m00s [ 31/170] Installing gmp-1:6.3.0-3.fc43 100% | 200.6 MiB/s | 821.5 KiB | 00m00s [ 32/170] Installing libuuid-0:2.40.4-8 100% | 37.4 MiB/s | 38.3 KiB | 00m00s [ 33/170] Installing popt-0:1.19-8.fc42 100% | 34.0 MiB/s | 139.4 KiB | 00m00s [ 34/170] Installing npth-0:1.8-2.fc42. 100% | 49.5 MiB/s | 50.7 KiB | 00m00s [ 35/170] Installing libblkid-0:2.40.4- 100% | 128.6 MiB/s | 263.4 KiB | 00m00s [ 36/170] Installing libxcrypt-0:4.4.38 100% | 140.2 MiB/s | 287.2 KiB | 00m00s [ 37/170] Installing libzstd-0:1.5.7-1. 100% | 263.4 MiB/s | 809.1 KiB | 00m00s [ 38/170] Installing elfutils-libelf-0: 100% | 233.3 MiB/s | 1.2 MiB | 00m00s [ 39/170] Installing gnupg2-gpgconf-0:2 100% | 17.6 MiB/s | 252.1 KiB | 00m00s [ 40/170] Installing libattr-0:2.5.2-5. 100% | 27.4 MiB/s | 28.1 KiB | 00m00s [ 41/170] Installing libacl-0:2.3.2-3.f 100% | 38.2 MiB/s | 39.2 KiB | 00m00s [ 42/170] Installing sqlite-libs-0:3.50 100% | 252.7 MiB/s | 1.5 MiB | 00m00s [ 43/170] Installing libtasn1-0:4.20.0- 100% | 173.9 MiB/s | 178.1 KiB | 00m00s [ 44/170] Installing libunistring-0:1.1 100% | 287.8 MiB/s | 1.7 MiB | 00m00s [ 45/170] Installing libidn2-0:2.3.8-1. 100% | 28.7 MiB/s | 558.7 KiB | 00m00s [ 46/170] Installing crypto-policies-0: 100% | 16.3 MiB/s | 167.3 KiB | 00m00s [ 47/170] Installing dwz-0:0.16-1.fc43. 100% | 18.8 MiB/s | 288.5 KiB | 00m00s [ 48/170] Installing mpfr-0:4.2.2-1.fc4 100% | 202.7 MiB/s | 830.4 KiB | 00m00s [ 49/170] Installing gawk-0:5.3.2-1.fc4 100% | 79.0 MiB/s | 1.8 MiB | 00m00s [ 50/170] Installing libksba-0:1.6.7-3. 100% | 197.8 MiB/s | 405.1 KiB | 00m00s [ 51/170] Installing unzip-0:6.0-66.fc4 100% | 27.5 MiB/s | 393.8 KiB | 00m00s [ 52/170] Installing file-libs-0:5.46-5 100% | 474.3 MiB/s | 11.9 MiB | 00m00s [ 53/170] Installing file-0:5.46-5.fc43 100% | 7.6 MiB/s | 101.7 KiB | 00m00s [ 54/170] Installing pcre2-0:10.45-1.fc 100% | 227.6 MiB/s | 699.1 KiB | 00m00s [ 55/170] Installing grep-0:3.12-1.fc43 100% | 50.1 MiB/s | 1.0 MiB | 00m00s [ 56/170] Installing xz-1:5.8.1-1.fc43. 100% | 57.9 MiB/s | 1.3 MiB | 00m00s [ 57/170] Installing libeconf-0:0.7.9-1 100% | 65.0 MiB/s | 66.5 KiB | 00m00s [ 58/170] Installing libcap-ng-0:0.8.5- 100% | 69.2 MiB/s | 70.8 KiB | 00m00s [ 59/170] Installing audit-libs-0:4.0.5 100% | 172.6 MiB/s | 353.4 KiB | 00m00s [ 60/170] Installing pam-libs-0:1.7.0-4 100% | 63.1 MiB/s | 129.1 KiB | 00m00s [ 61/170] Installing libcap-0:2.76-1.fc 100% | 14.0 MiB/s | 214.3 KiB | 00m00s [ 62/170] Installing systemd-libs-0:257 100% | 248.0 MiB/s | 2.2 MiB | 00m00s [ 63/170] Installing libsmartcols-0:2.4 100% | 173.4 MiB/s | 177.5 KiB | 00m00s [ 64/170] Installing libsepol-0:3.8-1.f 100% | 269.2 MiB/s | 827.0 KiB | 00m00s [ 65/170] Installing libselinux-0:3.8-3 100% | 94.9 MiB/s | 194.3 KiB | 00m00s [ 66/170] Installing findutils-1:4.10.0 100% | 85.2 MiB/s | 1.9 MiB | 00m00s [ 67/170] Installing sed-0:4.9-4.fc42.x 100% | 44.5 MiB/s | 865.5 KiB | 00m00s [ 68/170] Installing libmount-0:2.40.4- 100% | 174.5 MiB/s | 357.4 KiB | 00m00s [ 69/170] Installing lz4-libs-0:1.10.0- 100% | 154.7 MiB/s | 158.5 KiB | 00m00s [ 70/170] Installing lua-libs-0:5.4.8-1 100% | 137.7 MiB/s | 282.0 KiB | 00m00s [ 71/170] Installing json-c-0:0.18-2.fc 100% | 85.9 MiB/s | 88.0 KiB | 00m00s [ 72/170] Installing libffi-0:3.5.1-1.f 100% | 83.0 MiB/s | 85.0 KiB | 00m00s [ 73/170] Installing p11-kit-0:0.25.5-8 100% | 80.9 MiB/s | 2.2 MiB | 00m00s [ 74/170] Installing alternatives-0:1.3 100% | 5.2 MiB/s | 63.8 KiB | 00m00s [ 75/170] Installing p11-kit-trust-0:0. 100% | 13.4 MiB/s | 397.1 KiB | 00m00s [ 76/170] Installing zstd-0:1.5.7-1.fc4 100% | 81.4 MiB/s | 1.7 MiB | 00m00s [ 77/170] Installing util-linux-core-0: 100% | 62.0 MiB/s | 1.4 MiB | 00m00s [ 78/170] Installing tar-2:1.35-5.fc42. 100% | 113.9 MiB/s | 3.0 MiB | 00m00s [ 79/170] Installing libsemanage-0:3.8. 100% | 149.5 MiB/s | 306.2 KiB | 00m00s [ 80/170] Installing systemd-standalone 100% | 20.9 MiB/s | 278.0 KiB | 00m00s [ 81/170] Installing libusb1-0:1.0.28-2 100% | 84.3 MiB/s | 172.7 KiB | 00m00s [ 82/170] Installing zip-0:3.0-43.fc42. 100% | 45.7 MiB/s | 702.4 KiB | 00m00s [ 83/170] Installing gnupg2-keyboxd-0:2 100% | 13.2 MiB/s | 202.7 KiB | 00m00s [ 84/170] Installing libpsl-0:0.21.5-5. 100% | 75.7 MiB/s | 77.5 KiB | 00m00s [ 85/170] Installing libfdisk-0:2.40.4- 100% | 182.3 MiB/s | 373.4 KiB | 00m00s [ 86/170] Installing gnupg2-verify-0:2. 100% | 21.4 MiB/s | 349.9 KiB | 00m00s [ 87/170] Installing nettle-0:3.10.1-1. 100% | 193.8 MiB/s | 793.6 KiB | 00m00s [ 88/170] Installing gnutls-0:3.8.9-5.f 100% | 238.2 MiB/s | 3.6 MiB | 00m00s [ 89/170] Installing libxml2-0:2.12.10- 100% | 89.7 MiB/s | 1.7 MiB | 00m00s [ 90/170] Installing bzip2-0:1.0.8-20.f 100% | 7.8 MiB/s | 103.8 KiB | 00m00s [ 91/170] Installing add-determinism-0: 100% | 117.4 MiB/s | 2.5 MiB | 00m00s [ 92/170] Installing build-reproducibil 100% | 0.0 B/s | 1.0 KiB | 00m00s [ 93/170] Installing cpio-0:2.15-2.fc41 100% | 57.9 MiB/s | 1.1 MiB | 00m00s [ 94/170] Installing diffutils-0:3.12-2 100% | 74.3 MiB/s | 1.6 MiB | 00m00s [ 95/170] Installing ed-0:1.21-2.fc42.x 100% | 10.4 MiB/s | 148.8 KiB | 00m00s [ 96/170] Installing patch-0:2.8-1.fc43 100% | 15.9 MiB/s | 228.3 KiB | 00m00s [ 97/170] Installing libtool-ltdl-0:2.5 100% | 69.6 MiB/s | 71.2 KiB | 00m00s [ 98/170] Installing gdbm-libs-1:1.23-9 100% | 64.2 MiB/s | 131.6 KiB | 00m00s [ 99/170] Installing cyrus-sasl-lib-0:2 100% | 109.7 MiB/s | 2.3 MiB | 00m00s [100/170] Installing libgomp-0:15.1.1-2 100% | 263.9 MiB/s | 540.5 KiB | 00m00s [101/170] Installing jansson-0:2.14-2.f 100% | 92.2 MiB/s | 94.4 KiB | 00m00s [102/170] Installing libpkgconf-0:2.3.0 100% | 77.4 MiB/s | 79.2 KiB | 00m00s [103/170] Installing pkgconf-0:2.3.0-2. 100% | 6.3 MiB/s | 91.0 KiB | 00m00s [104/170] Installing pkgconf-pkg-config 100% | 147.8 KiB/s | 1.8 KiB | 00m00s [105/170] Installing xxhash-libs-0:0.8. 100% | 89.4 MiB/s | 91.6 KiB | 00m00s [106/170] Installing libbrotli-0:1.1.0- 100% | 204.0 MiB/s | 835.6 KiB | 00m00s [107/170] Installing libnghttp2-0:1.65. 100% | 159.5 MiB/s | 163.3 KiB | 00m00s [108/170] Installing keyutils-libs-0:1. 100% | 58.3 MiB/s | 59.7 KiB | 00m00s [109/170] Installing libcom_err-0:1.47. 100% | 66.6 MiB/s | 68.2 KiB | 00m00s [110/170] Installing libverto-0:0.3.2-1 100% | 26.6 MiB/s | 27.2 KiB | 00m00s [111/170] Installing filesystem-srpm-ma 100% | 38.0 MiB/s | 38.9 KiB | 00m00s [112/170] Installing elfutils-default-y 100% | 170.2 KiB/s | 2.0 KiB | 00m00s [113/170] Installing elfutils-libs-0:0. 100% | 167.3 MiB/s | 685.2 KiB | 00m00s [114/170] Installing rust-srpm-macros-0 100% | 5.4 MiB/s | 5.6 KiB | 00m00s [115/170] Installing qt6-srpm-macros-0: 100% | 0.0 B/s | 740.0 B | 00m00s [116/170] Installing qt5-srpm-macros-0: 100% | 0.0 B/s | 776.0 B | 00m00s [117/170] Installing perl-srpm-macros-0 100% | 0.0 B/s | 1.1 KiB | 00m00s [118/170] Installing package-notes-srpm 100% | 0.0 B/s | 2.0 KiB | 00m00s [119/170] Installing openblas-srpm-macr 100% | 0.0 B/s | 392.0 B | 00m00s [120/170] Installing ocaml-srpm-macros- 100% | 0.0 B/s | 2.2 KiB | 00m00s [121/170] Installing kernel-srpm-macros 100% | 0.0 B/s | 2.3 KiB | 00m00s [122/170] Installing gnat-srpm-macros-0 100% | 0.0 B/s | 1.3 KiB | 00m00s [123/170] Installing ghc-srpm-macros-0: 100% | 0.0 B/s | 1.0 KiB | 00m00s [124/170] Installing fpc-srpm-macros-0: 100% | 0.0 B/s | 420.0 B | 00m00s [125/170] Installing ansible-srpm-macro 100% | 35.4 MiB/s | 36.2 KiB | 00m00s [126/170] Installing coreutils-common-0 100% | 245.5 MiB/s | 11.3 MiB | 00m00s [127/170] Installing openssl-libs-1:3.5 100% | 317.3 MiB/s | 8.9 MiB | 00m00s [128/170] Installing coreutils-0:9.7-3. 100% | 104.7 MiB/s | 5.4 MiB | 00m00s [129/170] Installing ca-certificates-0: 100% | 1.2 MiB/s | 2.4 MiB | 00m02s [130/170] Installing libarchive-0:3.8.1 100% | 186.1 MiB/s | 953.1 KiB | 00m00s [131/170] Installing krb5-libs-0:1.21.3 100% | 84.9 MiB/s | 2.3 MiB | 00m00s >>> Running sysusers scriptlet: tpm2-tss-0:4.1.3-7.fc43.x86_64 >>> Finished sysusers scriptlet: tpm2-tss-0:4.1.3-7.fc43.x86_64 >>> Scriptlet output: >>> Creating group 'tss' with GID 59. >>> Creating user 'tss' (Account used for TPM access) with UID 59 and GID 59. >>> [132/170] Installing tpm2-tss-0:4.1.3-7 100% | 174.2 MiB/s | 1.6 MiB | 00m00s [133/170] Installing ima-evm-utils-libs 100% | 60.5 MiB/s | 62.0 KiB | 00m00s [134/170] Installing gnupg2-gpg-agent-0 100% | 20.6 MiB/s | 675.4 KiB | 00m00s [135/170] Installing libssh-0:0.11.1-4. 100% | 138.6 MiB/s | 567.5 KiB | 00m00s [136/170] Installing gzip-0:1.13-3.fc42 100% | 24.3 MiB/s | 398.4 KiB | 00m00s [137/170] Installing rpm-sequoia-0:1.8. 100% | 278.2 MiB/s | 2.5 MiB | 00m00s [138/170] Installing rpm-libs-0:5.99.90 100% | 227.4 MiB/s | 931.4 KiB | 00m00s [139/170] Installing libfsverity-0:1.6- 100% | 32.7 MiB/s | 33.5 KiB | 00m00s [140/170] Installing libevent-0:2.1.12- 100% | 221.4 MiB/s | 906.9 KiB | 00m00s [141/170] Installing openldap-0:2.6.10- 100% | 161.0 MiB/s | 659.6 KiB | 00m00s [142/170] Installing libcurl-0:8.14.1-1 100% | 218.8 MiB/s | 896.3 KiB | 00m00s [143/170] Installing elfutils-debuginfo 100% | 6.0 MiB/s | 86.2 KiB | 00m00s [144/170] Installing elfutils-0:0.193-2 100% | 121.8 MiB/s | 2.9 MiB | 00m00s [145/170] Installing binutils-0:2.44-3. 100% | 235.5 MiB/s | 25.9 MiB | 00m00s [146/170] Installing gdb-minimal-0:16.3 100% | 236.6 MiB/s | 13.2 MiB | 00m00s [147/170] Installing debugedit-0:5.1-6. 100% | 11.9 MiB/s | 195.4 KiB | 00m00s [148/170] Installing curl-0:8.14.1-1.fc 100% | 14.1 MiB/s | 476.8 KiB | 00m00s [149/170] Installing rpm-0:5.99.90-6.fc 100% | 44.8 MiB/s | 2.5 MiB | 00m00s [150/170] Installing efi-srpm-macros-0: 100% | 40.2 MiB/s | 41.1 KiB | 00m00s [151/170] Installing java-srpm-macros-0 100% | 0.0 B/s | 1.1 KiB | 00m00s [152/170] Installing lua-srpm-macros-0: 100% | 1.9 MiB/s | 1.9 KiB | 00m00s [153/170] Installing tree-sitter-srpm-m 100% | 9.0 MiB/s | 9.3 KiB | 00m00s [154/170] Installing zig-srpm-macros-0: 100% | 1.6 MiB/s | 1.7 KiB | 00m00s [155/170] Installing gnupg2-dirmngr-0:2 100% | 20.2 MiB/s | 621.1 KiB | 00m00s [156/170] Installing gnupg2-0:2.4.8-2.f 100% | 163.8 MiB/s | 6.6 MiB | 00m00s [157/170] Installing rpm-sign-libs-0:5. 100% | 39.6 MiB/s | 40.5 KiB | 00m00s [158/170] Installing rpm-build-libs-0:5 100% | 129.5 MiB/s | 265.2 KiB | 00m00s [159/170] Installing gpgverify-0:2.1-3. 100% | 9.2 MiB/s | 9.4 KiB | 00m00s [160/170] Installing rpm-build-0:5.99.9 100% | 15.8 MiB/s | 290.5 KiB | 00m00s [161/170] Installing pyproject-srpm-mac 100% | 2.4 MiB/s | 2.5 KiB | 00m00s [162/170] Installing redhat-rpm-config- 100% | 61.1 MiB/s | 187.8 KiB | 00m00s [163/170] Installing forge-srpm-macros- 100% | 39.3 MiB/s | 40.3 KiB | 00m00s [164/170] Installing fonts-srpm-macros- 100% | 55.7 MiB/s | 57.0 KiB | 00m00s [165/170] Installing go-srpm-macros-0:3 100% | 60.5 MiB/s | 62.0 KiB | 00m00s [166/170] Installing python-srpm-macros 100% | 51.8 MiB/s | 53.1 KiB | 00m00s [167/170] Installing which-0:2.23-1.fc4 100% | 5.6 MiB/s | 85.6 KiB | 00m00s [168/170] Installing util-linux-0:2.40. 100% | 60.8 MiB/s | 3.5 MiB | 00m00s [169/170] Installing shadow-utils-2:4.1 100% | 82.7 MiB/s | 4.1 MiB | 00m00s [170/170] Installing info-0:7.2-3.fc42. 100% | 131.5 KiB/s | 358.3 KiB | 00m03s Warning: skipped OpenPGP checks for 3 packages from repository: copr_base Complete! Finish: installing minimal buildroot with dnf5 Start: creating root cache Finish: creating root cache Finish: chroot init INFO: Installed packages: INFO: add-determinism-0.6.0-1.fc43.x86_64 alternatives-1.33-1.fc43.x86_64 ansible-srpm-macros-1-17.1.fc42.noarch audit-libs-4.0.5-1.fc43.x86_64 bash-5.2.37-3.fc43.x86_64 binutils-2.44-3.fc43.x86_64 build-reproducibility-srpm-macros-0.6.0-1.fc43.noarch bzip2-1.0.8-20.fc42.x86_64 bzip2-libs-1.0.8-20.fc42.x86_64 ca-certificates-2024.2.69_v8.0.401-5.fc42.noarch coreutils-9.7-3.fc43.x86_64 coreutils-common-9.7-3.fc43.x86_64 cpio-2.15-2.fc41.x86_64 crypto-policies-20250603-1.git3a584b3.fc43.noarch curl-8.14.1-1.fc43.x86_64 cyrus-sasl-lib-2.1.28-30.fc42.x86_64 debugedit-5.1-6.fc43.x86_64 diffutils-3.12-2.fc43.x86_64 dwz-0.16-1.fc43.x86_64 ed-1.21-2.fc42.x86_64 efi-srpm-macros-6-3.fc43.noarch elfutils-0.193-2.fc43.x86_64 elfutils-debuginfod-client-0.193-2.fc43.x86_64 elfutils-default-yama-scope-0.193-2.fc43.noarch elfutils-libelf-0.193-2.fc43.x86_64 elfutils-libs-0.193-2.fc43.x86_64 fedora-gpg-keys-43-0.2.noarch fedora-release-43-0.16.noarch fedora-release-common-43-0.16.noarch fedora-release-identity-basic-43-0.16.noarch fedora-repos-43-0.2.noarch fedora-repos-rawhide-43-0.2.noarch file-5.46-5.fc43.x86_64 file-libs-5.46-5.fc43.x86_64 filesystem-3.18-44.fc43.x86_64 filesystem-srpm-macros-3.18-44.fc43.noarch findutils-4.10.0-5.fc42.x86_64 fonts-srpm-macros-2.0.5-22.fc43.noarch forge-srpm-macros-0.4.0-2.fc42.noarch fpc-srpm-macros-1.3-14.fc42.noarch gawk-5.3.2-1.fc43.x86_64 gdb-minimal-16.3-3.fc43.x86_64 gdbm-libs-1.23-9.fc42.x86_64 ghc-srpm-macros-1.9.2-2.fc42.noarch glibc-2.41.9000-15.fc43.x86_64 glibc-common-2.41.9000-15.fc43.x86_64 glibc-gconv-extra-2.41.9000-15.fc43.x86_64 glibc-minimal-langpack-2.41.9000-15.fc43.x86_64 gmp-6.3.0-3.fc43.x86_64 gnat-srpm-macros-6-7.fc42.noarch gnupg2-2.4.8-2.fc43.x86_64 gnupg2-dirmngr-2.4.8-2.fc43.x86_64 gnupg2-gpg-agent-2.4.8-2.fc43.x86_64 gnupg2-gpgconf-2.4.8-2.fc43.x86_64 gnupg2-keyboxd-2.4.8-2.fc43.x86_64 gnupg2-verify-2.4.8-2.fc43.x86_64 gnutls-3.8.9-5.fc43.x86_64 go-srpm-macros-3.6.0-7.fc43.noarch gpg-pubkey-36f612dcf27f7d1a48a835e4dbfcf71c6d9f90a6-6786af3b gpg-pubkey-b0f4950458f69e1150c6c5edc8ac4916105ef944-65ca83d1 gpg-pubkey-c6e7f081cf80e13146676e88829b606631645531-66b6dccf gpgverify-2.1-3.fc43.noarch grep-3.12-1.fc43.x86_64 gzip-1.13-3.fc42.x86_64 ima-evm-utils-libs-1.6.2-5.fc43.x86_64 info-7.2-3.fc42.x86_64 jansson-2.14-2.fc42.x86_64 java-srpm-macros-1-4.fc43.noarch json-c-0.18-2.fc42.x86_64 kernel-srpm-macros-1.0-25.fc42.noarch keyutils-libs-1.6.3-5.fc42.x86_64 krb5-libs-1.21.3-6.fc43.x86_64 libacl-2.3.2-3.fc42.x86_64 libarchive-3.8.1-1.fc43.x86_64 libassuan-2.5.7-3.fc42.x86_64 libattr-2.5.2-5.fc42.x86_64 libblkid-2.40.4-8.fc43.x86_64 libbrotli-1.1.0-7.fc43.x86_64 libcap-2.76-1.fc43.x86_64 libcap-ng-0.8.5-5.fc43.x86_64 libcom_err-1.47.2-3.fc42.x86_64 libcurl-8.14.1-1.fc43.x86_64 libeconf-0.7.9-1.fc43.x86_64 libevent-2.1.12-15.fc42.x86_64 libfdisk-2.40.4-8.fc43.x86_64 libffi-3.5.1-1.fc43.x86_64 libfsverity-1.6-2.fc42.x86_64 libgcc-15.1.1-2.fc43.x86_64 libgcrypt-1.11.1-1.fc43.x86_64 libgomp-15.1.1-2.fc43.x86_64 libgpg-error-1.55-1.fc43.x86_64 libidn2-2.3.8-1.fc43.x86_64 libksba-1.6.7-3.fc42.x86_64 libmount-2.40.4-8.fc43.x86_64 libnghttp2-1.65.0-1.fc43.x86_64 libpkgconf-2.3.0-2.fc42.x86_64 libpsl-0.21.5-5.fc42.x86_64 libselinux-3.8-3.fc43.x86_64 libsemanage-3.8.1-3.fc43.x86_64 libsepol-3.8-1.fc42.x86_64 libsmartcols-2.40.4-8.fc43.x86_64 libssh-0.11.1-4.fc42.x86_64 libssh-config-0.11.1-4.fc42.noarch libstdc++-15.1.1-2.fc43.x86_64 libtasn1-4.20.0-1.fc43.x86_64 libtool-ltdl-2.5.4-4.fc42.x86_64 libunistring-1.1-9.fc42.x86_64 libusb1-1.0.28-2.fc43.x86_64 libuuid-2.40.4-8.fc43.x86_64 libverto-0.3.2-10.fc42.x86_64 libxcrypt-4.4.38-7.fc43.x86_64 libxml2-2.12.10-2.fc43.x86_64 libzstd-1.5.7-1.fc43.x86_64 lua-libs-5.4.8-1.fc43.x86_64 lua-srpm-macros-1-15.fc42.noarch lz4-libs-1.10.0-2.fc42.x86_64 mpfr-4.2.2-1.fc43.x86_64 ncurses-base-6.5-5.20250125.fc42.noarch ncurses-libs-6.5-5.20250125.fc42.x86_64 nettle-3.10.1-1.fc43.x86_64 npth-1.8-2.fc42.x86_64 ocaml-srpm-macros-10-4.fc42.noarch openblas-srpm-macros-2-19.fc42.noarch openldap-2.6.10-1.fc43.x86_64 openssl-libs-3.5.0-5.fc43.x86_64 p11-kit-0.25.5-8.fc43.x86_64 p11-kit-trust-0.25.5-8.fc43.x86_64 package-notes-srpm-macros-0.5-13.fc42.noarch pam-libs-1.7.0-4.fc42.x86_64 patch-2.8-1.fc43.x86_64 pcre2-10.45-1.fc43.x86_64 pcre2-syntax-10.45-1.fc43.noarch perl-srpm-macros-1-57.fc42.noarch pkgconf-2.3.0-2.fc42.x86_64 pkgconf-m4-2.3.0-2.fc42.noarch pkgconf-pkg-config-2.3.0-2.fc42.x86_64 popt-1.19-8.fc42.x86_64 publicsuffix-list-dafsa-20250116-1.fc42.noarch pyproject-srpm-macros-1.18.1-1.fc43.noarch python-srpm-macros-3.14-1.fc43.noarch qt5-srpm-macros-5.15.17-1.fc43.noarch qt6-srpm-macros-6.9.1-1.fc43.noarch readline-8.2-13.fc43.x86_64 redhat-rpm-config-343-6.fc43.noarch rpm-5.99.90-6.fc43.x86_64 rpm-build-5.99.90-6.fc43.x86_64 rpm-build-libs-5.99.90-6.fc43.x86_64 rpm-libs-5.99.90-6.fc43.x86_64 rpm-sequoia-1.8.0-1.fc43.x86_64 rpm-sign-libs-5.99.90-6.fc43.x86_64 rust-srpm-macros-26.3-4.fc42.noarch sed-4.9-4.fc42.x86_64 setup-2.15.0-25.fc43.noarch shadow-utils-4.17.4-1.fc43.x86_64 sqlite-libs-3.50.0-1.fc43.x86_64 systemd-libs-257.6-1.fc43.x86_64 systemd-standalone-sysusers-257.6-1.fc43.x86_64 tar-1.35-5.fc42.x86_64 tpm2-tss-4.1.3-7.fc43.x86_64 tree-sitter-srpm-macros-0.4.1-1.fc43.noarch unzip-6.0-66.fc42.x86_64 util-linux-2.40.4-8.fc43.x86_64 util-linux-core-2.40.4-8.fc43.x86_64 which-2.23-1.fc42.x86_64 xxhash-libs-0.8.3-2.fc42.x86_64 xz-5.8.1-1.fc43.x86_64 xz-libs-5.8.1-1.fc43.x86_64 zig-srpm-macros-1-4.fc42.noarch zip-3.0-43.fc42.x86_64 zlib-ng-compat-2.2.4-2.fc43.x86_64 zstd-1.5.7-1.fc43.x86_64 Start: buildsrpm Start: rpmbuild -bs Building target platforms: x86_64 Building for target x86_64 setting SOURCE_DATE_EPOCH=1750118400 Wrote: /builddir/build/SRPMS/rccl-6.4.1-3.fc43.src.rpm Finish: rpmbuild -bs INFO: chroot_scan: 1 files copied to /var/lib/copr-rpmbuild/results/chroot_scan INFO: /var/lib/mock/fedora-rawhide-x86_64-1750253281.853958/root/var/log/dnf5.log INFO: chroot_scan: creating tarball /var/lib/copr-rpmbuild/results/chroot_scan.tar.gz /bin/tar: Removing leading `/' from member names Finish: buildsrpm INFO: Done(/var/lib/copr-rpmbuild/workspace/workdir-mg18jaca/rccl/rccl.spec) Config(child) 0 minutes 36 seconds INFO: Results and/or logs in: /var/lib/copr-rpmbuild/results INFO: Cleaning up build root ('cleanup_on_success=True') Start: clean chroot INFO: unmounting tmpfs. Finish: clean chroot INFO: Start(/var/lib/copr-rpmbuild/results/rccl-6.4.1-3.fc43.src.rpm) Config(fedora-rawhide-x86_64) Start(bootstrap): chroot init INFO: mounting tmpfs at /var/lib/mock/fedora-rawhide-x86_64-bootstrap-1750253281.853958/root. INFO: reusing tmpfs at /var/lib/mock/fedora-rawhide-x86_64-bootstrap-1750253281.853958/root. INFO: calling preinit hooks INFO: enabled root cache INFO: enabled package manager cache Start(bootstrap): cleaning package manager metadata Finish(bootstrap): cleaning package manager metadata Finish(bootstrap): chroot init Start: chroot init INFO: mounting tmpfs at /var/lib/mock/fedora-rawhide-x86_64-1750253281.853958/root. INFO: calling preinit hooks INFO: enabled root cache Start: unpacking root cache Finish: unpacking root cache INFO: enabled package manager cache Start: cleaning package manager metadata Finish: cleaning package manager metadata INFO: enabled HW Info plugin INFO: Buildroot is handled by package management downloaded with a bootstrap image: rpm-5.99.90-6.fc43.x86_64 rpm-sequoia-1.8.0-1.fc43.x86_64 dnf5-5.2.13.1-3.fc43.x86_64 dnf5-plugins-5.2.13.1-3.fc43.x86_64 Finish: chroot init Start: build phase for rccl-6.4.1-3.fc43.src.rpm Start: build setup for rccl-6.4.1-3.fc43.src.rpm Building target platforms: x86_64 Building for target x86_64 setting SOURCE_DATE_EPOCH=1750118400 Wrote: /builddir/build/SRPMS/rccl-6.4.1-3.fc43.src.rpm Updating and loading repositories: fedora 100% | 738.8 KiB/s | 27.3 KiB | 00m00s Copr repository 100% | 39.3 KiB/s | 1.5 KiB | 00m00s Repositories loaded. Package Arch Version Repository Size Installing: cmake x86_64 3.31.6-3.fc43 fedora 34.5 MiB gcc-c++ x86_64 15.1.1-2.fc43 copr_base 41.3 MiB hipify x86_64 6.4.1-2.fc43 copr_base 3.1 MiB rocm-cmake noarch 6.4.0-1.fc43 copr_base 130.5 KiB rocm-comgr-devel x86_64 19-10.rocm6.4.1.fc43 copr_base 98.2 KiB rocm-core-devel x86_64 6.4.1-1.fc43 copr_base 14.8 KiB rocm-hip-devel x86_64 6.4.1-2.fc43 fedora 2.8 MiB rocm-rpm-macros noarch 6.4.0-4.fc43 fedora 18.9 KiB rocm-runtime-devel x86_64 6.4.1-1.fc43 copr_base 571.3 KiB rocm-smi-devel x86_64 6.4.1-1.fc43 copr_base 281.8 KiB Installing dependencies: annobin-docs noarch 12.96-1.fc43 fedora 98.9 KiB annobin-plugin-gcc x86_64 12.96-1.fc43 fedora 993.6 KiB cmake-data noarch 3.31.6-3.fc43 fedora 8.5 MiB cmake-filesystem x86_64 3.31.6-3.fc43 fedora 0.0 B cmake-rpm-macros noarch 3.31.6-3.fc43 fedora 7.7 KiB cpp x86_64 15.1.1-2.fc43 copr_base 37.9 MiB emacs-filesystem noarch 1:30.0-4.fc42 fedora 0.0 B environment-modules x86_64 5.5.0-3.fc42 fedora 1.8 MiB expat x86_64 2.7.1-1.fc43 fedora 294.2 KiB gcc x86_64 15.1.1-2.fc43 copr_base 111.1 MiB gcc-plugin-annobin x86_64 15.1.1-2.fc43 copr_base 57.2 KiB git x86_64 2.49.0-2.fc43 fedora 85.3 KiB git-core x86_64 2.49.0-2.fc43 fedora 22.8 MiB git-core-doc noarch 2.49.0-2.fc43 fedora 17.6 MiB glibc-devel x86_64 2.41.9000-15.fc43 fedora 2.3 MiB groff-base x86_64 1.23.0-8.fc42 fedora 3.9 MiB hipcc x86_64 19-10.rocm6.4.1.fc43 copr_base 652.9 KiB hwdata noarch 0.396-1.fc43 fedora 9.5 MiB jsoncpp x86_64 1.9.6-1.fc43 fedora 261.6 KiB kernel-headers x86_64 6.16.0-0.rc1.17.fc43 fedora 6.7 MiB less x86_64 678-1.fc43 fedora 405.8 KiB libcbor x86_64 0.11.0-3.fc42 fedora 77.8 KiB libdb x86_64 5.3.28-65.fc43 fedora 1.9 MiB libdrm x86_64 2.4.124-2.fc42 fedora 407.9 KiB libdrm-devel x86_64 2.4.124-2.fc42 fedora 708.5 KiB libedit x86_64 3.1-55.20250104cvs.fc42 fedora 244.1 KiB libfido2 x86_64 1.15.0-3.fc42 fedora 242.1 KiB libmpc x86_64 1.3.1-7.fc42 fedora 164.5 KiB libpciaccess x86_64 0.16-15.fc42 fedora 44.5 KiB libpciaccess-devel x86_64 0.16-15.fc42 fedora 15.3 KiB libpipeline x86_64 1.5.8-2.fc42 fedora 145.1 KiB libstdc++-devel x86_64 15.1.1-2.fc43 copr_base 16.1 MiB libtommath x86_64 1.3.1~rc1-5.fc42 fedora 130.4 KiB libuv x86_64 1:1.51.0-1.fc43 fedora 570.2 KiB libxcrypt-devel x86_64 4.4.38-7.fc43 fedora 30.8 KiB make x86_64 1:4.4.1-10.fc42 fedora 1.8 MiB man-db x86_64 2.13.1-1.fc43 fedora 2.9 MiB mpdecimal x86_64 4.0.1-1.fc43 fedora 217.2 KiB ncurses x86_64 6.5-5.20250125.fc42 fedora 608.1 KiB numactl-libs x86_64 2.0.19-2.fc42 fedora 52.9 KiB openssh x86_64 10.0p1-3.fc43 fedora 1.4 MiB openssh-clients x86_64 10.0p1-3.fc43 fedora 2.6 MiB perl x86_64 4:5.40.2-517.fc43 fedora 0.0 B perl-Algorithm-Diff noarch 1.2010-13.fc42 fedora 107.5 KiB perl-Archive-Tar noarch 3.04-1.fc43 fedora 154.4 KiB perl-Archive-Zip noarch 1.68-16.fc42 fedora 291.1 KiB perl-Attribute-Handlers noarch 1.03-517.fc43 fedora 39.9 KiB perl-AutoLoader noarch 5.74-517.fc43 fedora 20.5 KiB perl-AutoSplit noarch 5.74-517.fc43 fedora 23.1 KiB perl-B x86_64 1.89-517.fc43 fedora 498.0 KiB perl-Benchmark noarch 1.25-517.fc43 fedora 36.3 KiB perl-CPAN noarch 2.38-4.fc43 fedora 1.9 MiB perl-CPAN-Meta noarch 2.150010-512.fc42 fedora 592.2 KiB perl-CPAN-Meta-Requirements noarch 2.143-10.fc42 fedora 81.2 KiB perl-CPAN-Meta-YAML noarch 0.020-2.fc42 fedora 52.1 KiB perl-Carp noarch 1.54-512.fc42 fedora 46.6 KiB perl-Class-Struct noarch 0.68-517.fc43 fedora 25.4 KiB perl-Compress-Bzip2 x86_64 2.28-21.fc42 fedora 142.6 KiB perl-Compress-Raw-Bzip2 x86_64 2.213-2.fc42 fedora 67.3 KiB perl-Compress-Raw-Lzma x86_64 2.213-5.fc42 fedora 120.9 KiB perl-Compress-Raw-Zlib x86_64 2.213-2.fc42 fedora 163.2 KiB perl-Config-Extensions noarch 0.03-517.fc43 fedora 2.6 KiB perl-Config-Perl-V noarch 0.38-2.fc42 fedora 25.9 KiB perl-DBM_Filter noarch 0.06-517.fc43 fedora 28.5 KiB perl-DB_File x86_64 1.859-513.fc42 fedora 188.8 KiB perl-Data-Dumper x86_64 2.189-513.fc42 fedora 115.6 KiB perl-Data-OptList noarch 0.114-6.fc42 fedora 50.1 KiB perl-Data-Section noarch 0.200008-7.fc42 fedora 42.7 KiB perl-Devel-PPPort x86_64 3.72-513.fc42 fedora 892.1 KiB perl-Devel-Peek x86_64 1.34-517.fc43 fedora 43.5 KiB perl-Devel-SelfStubber noarch 1.06-517.fc43 fedora 6.7 KiB perl-Devel-Size x86_64 0.85-1.fc43 fedora 42.0 KiB perl-Digest noarch 1.20-512.fc42 fedora 35.3 KiB perl-Digest-MD5 x86_64 2.59-6.fc42 fedora 59.7 KiB perl-Digest-SHA x86_64 1:6.04-513.fc42 fedora 112.5 KiB perl-DirHandle noarch 1.05-517.fc43 fedora 3.4 KiB perl-Dumpvalue noarch 2.27-517.fc43 fedora 19.8 KiB perl-DynaLoader x86_64 1.56-517.fc43 fedora 32.1 KiB perl-Encode x86_64 4:3.21-512.fc42 fedora 4.7 MiB perl-Encode-devel x86_64 4:3.21-512.fc42 fedora 99.6 KiB perl-English noarch 1.11-517.fc43 fedora 6.2 KiB perl-Env noarch 1.06-512.fc42 fedora 26.1 KiB perl-Errno x86_64 1.38-517.fc43 fedora 8.3 KiB perl-Error noarch 1:0.17030-1.fc43 fedora 76.7 KiB perl-Exporter noarch 5.78-512.fc42 fedora 54.3 KiB perl-ExtUtils-CBuilder noarch 1:0.280240-512.fc42 fedora 96.9 KiB perl-ExtUtils-Command noarch 2:7.76-1.fc43 fedora 9.6 KiB perl-ExtUtils-Constant noarch 0.25-517.fc43 fedora 85.8 KiB perl-ExtUtils-Embed noarch 1.35-517.fc43 fedora 15.5 KiB perl-ExtUtils-Install noarch 2.22-512.fc42 fedora 85.5 KiB perl-ExtUtils-MM-Utils noarch 2:7.76-1.fc43 fedora 2.9 KiB perl-ExtUtils-MakeMaker noarch 2:7.76-1.fc43 fedora 739.6 KiB perl-ExtUtils-Manifest noarch 1:1.75-512.fc42 fedora 84.8 KiB perl-ExtUtils-Miniperl noarch 1.14-517.fc43 fedora 8.2 KiB perl-ExtUtils-ParseXS noarch 1:3.57-1.fc43 fedora 483.2 KiB perl-Fcntl x86_64 1.18-517.fc43 fedora 48.9 KiB perl-File-Basename noarch 2.86-517.fc43 fedora 14.0 KiB perl-File-Compare noarch 1.100.800-517.fc43 fedora 5.6 KiB perl-File-Copy noarch 2.41-517.fc43 fedora 19.6 KiB perl-File-DosGlob x86_64 1.12-517.fc43 fedora 20.8 KiB perl-File-Fetch noarch 1.08-1.fc43 fedora 60.3 KiB perl-File-Find noarch 1.44-517.fc43 fedora 41.9 KiB perl-File-HomeDir noarch 1.006-14.fc42 fedora 119.3 KiB perl-File-Path noarch 2.18-512.fc42 fedora 63.5 KiB perl-File-Temp noarch 1:0.231.100-512.fc42 fedora 162.3 KiB perl-File-Which noarch 1.27-13.fc42 fedora 30.4 KiB perl-File-stat noarch 1.14-517.fc43 fedora 12.5 KiB perl-FileCache noarch 1.10-517.fc43 fedora 7.4 KiB perl-FileHandle noarch 2.05-517.fc43 fedora 9.3 KiB perl-Filter x86_64 2:1.64-513.fc42 fedora 156.7 KiB perl-Filter-Simple noarch 0.96-512.fc42 fedora 50.7 KiB perl-FindBin noarch 1.54-517.fc43 fedora 6.7 KiB perl-GDBM_File x86_64 1:1.24-517.fc43 fedora 79.6 KiB perl-Getopt-Long noarch 1:2.58-3.fc42 fedora 144.5 KiB perl-Getopt-Std noarch 1.14-517.fc43 fedora 11.2 KiB perl-Git noarch 2.49.0-2.fc43 fedora 64.0 KiB perl-HTTP-Tiny noarch 0.090-2.fc42 fedora 154.4 KiB perl-Hash-Util x86_64 0.32-517.fc43 fedora 55.0 KiB perl-Hash-Util-FieldHash x86_64 1.27-517.fc43 fedora 62.5 KiB perl-I18N-Collate noarch 1.02-517.fc43 fedora 7.1 KiB perl-I18N-LangTags noarch 0.45-517.fc43 fedora 82.3 KiB perl-I18N-Langinfo x86_64 0.24-517.fc43 fedora 34.7 KiB perl-IO x86_64 1.55-517.fc43 fedora 147.0 KiB perl-IO-Compress noarch 2.213-3.fc42 fedora 1.0 MiB perl-IO-Compress-Lzma noarch 2.213-2.fc42 fedora 215.2 KiB perl-IO-Socket-IP noarch 0.43-2.fc42 fedora 100.3 KiB perl-IO-Socket-SSL noarch 2.091-1.fc43 fedora 711.4 KiB perl-IO-Zlib noarch 1:1.15-512.fc42 fedora 25.7 KiB perl-IPC-Cmd noarch 2:1.04-513.fc42 fedora 84.9 KiB perl-IPC-Open3 noarch 1.22-517.fc43 fedora 22.5 KiB perl-IPC-SysV x86_64 2.09-513.fc42 fedora 73.7 KiB perl-IPC-System-Simple noarch 1.30-15.fc42 fedora 71.7 KiB perl-JSON-PP noarch 1:4.16-513.fc42 fedora 141.8 KiB perl-Locale-Maketext noarch 1.33-513.fc42 fedora 171.3 KiB perl-Locale-Maketext-Simple noarch 1:0.21-517.fc43 fedora 12.8 KiB perl-MIME-Base32 noarch 1.303-23.fc42 fedora 30.7 KiB perl-MIME-Base64 x86_64 3.16-512.fc42 fedora 42.0 KiB perl-MRO-Compat noarch 0.15-11.fc42 fedora 43.0 KiB perl-Math-BigInt noarch 1:2.0050.03-1.fc43 fedora 1.1 MiB perl-Math-BigInt-FastCalc x86_64 0.502.000-1.fc43 fedora 44.0 KiB perl-Math-Complex noarch 1.62-517.fc43 fedora 85.0 KiB perl-Memoize noarch 1.16-517.fc43 fedora 64.5 KiB perl-Module-Build noarch 2:0.42.34-8.fc42 fedora 654.2 KiB perl-Module-CoreList noarch 1:5.20250528-1.fc43 fedora 1.2 MiB perl-Module-CoreList-tools noarch 1:5.20250528-1.fc43 fedora 18.6 KiB perl-Module-Load noarch 1:0.36-512.fc42 fedora 14.9 KiB perl-Module-Load-Conditional noarch 0.74-512.fc42 fedora 28.7 KiB perl-Module-Loaded noarch 1:0.08-517.fc43 fedora 5.0 KiB perl-Module-Metadata noarch 1.000038-512.fc42 fedora 67.5 KiB perl-Module-Signature noarch 0.90-1.fc43 fedora 139.6 KiB perl-NDBM_File x86_64 1.17-517.fc43 fedora 28.4 KiB perl-NEXT noarch 0.69-517.fc43 fedora 23.5 KiB perl-Net noarch 1.04-517.fc43 fedora 22.3 KiB perl-Net-Ping noarch 2.76-512.fc42 fedora 134.2 KiB perl-Net-SSLeay x86_64 1.94-9.fc43 fedora 1.3 MiB perl-ODBM_File x86_64 1.18-517.fc43 fedora 28.3 KiB perl-Opcode x86_64 1.65-517.fc43 fedora 48.5 KiB perl-POSIX x86_64 2.20-517.fc43 fedora 231.0 KiB perl-Package-Generator noarch 1.106-33.fc42 fedora 29.9 KiB perl-Params-Check noarch 1:0.38-512.fc42 fedora 27.6 KiB perl-Params-Util x86_64 1.102-17.fc42 fedora 58.5 KiB perl-PathTools x86_64 3.91-513.fc42 fedora 180.0 KiB perl-Perl-OSType noarch 1.010-513.fc42 fedora 32.8 KiB perl-PerlIO-via-QuotedPrint noarch 0.10-512.fc42 fedora 30.2 KiB perl-Pod-Checker noarch 4:1.77-512.fc42 fedora 52.2 KiB perl-Pod-Escapes noarch 1:1.07-512.fc42 fedora 24.9 KiB perl-Pod-Functions noarch 1.14-517.fc43 fedora 14.2 KiB perl-Pod-Html noarch 1.35-517.fc43 fedora 42.2 KiB perl-Pod-Perldoc noarch 3.28.01-513.fc42 fedora 163.7 KiB perl-Pod-Simple noarch 1:3.47-1.fc43 fedora 565.2 KiB perl-Pod-Usage noarch 4:2.05-1.fc43 fedora 86.3 KiB perl-Safe noarch 2.46-517.fc43 fedora 30.6 KiB perl-Scalar-List-Utils x86_64 5:1.69-1.fc43 fedora 144.8 KiB perl-Search-Dict noarch 1.07-517.fc43 fedora 4.7 KiB perl-SelectSaver noarch 1.02-517.fc43 fedora 2.2 KiB perl-SelfLoader noarch 1.27-517.fc43 fedora 22.4 KiB perl-Socket x86_64 4:2.038-512.fc42 fedora 119.9 KiB perl-Software-License noarch 0.104007-1.fc43 fedora 500.7 KiB perl-Storable x86_64 1:3.32-512.fc42 fedora 232.3 KiB perl-Sub-Exporter noarch 0.991-5.fc42 fedora 194.9 KiB perl-Sub-Install noarch 0.929-7.fc42 fedora 35.9 KiB perl-Symbol noarch 1.09-517.fc43 fedora 6.8 KiB perl-Sys-Hostname x86_64 1.25-517.fc43 fedora 15.8 KiB perl-Sys-Syslog x86_64 0.36-513.fc42 fedora 94.7 KiB perl-Term-ANSIColor noarch 5.01-513.fc42 fedora 97.5 KiB perl-Term-Cap noarch 1.18-512.fc42 fedora 29.3 KiB perl-Term-Complete noarch 1.403-517.fc43 fedora 5.7 KiB perl-Term-ReadLine noarch 1.17-517.fc43 fedora 17.3 KiB perl-Term-Table noarch 0.024-2.fc42 fedora 77.9 KiB perl-TermReadKey x86_64 2.38-24.fc42 fedora 64.0 KiB perl-Test noarch 1.31-517.fc43 fedora 37.0 KiB perl-Test-Harness noarch 1:3.52-1.fc43 fedora 560.6 KiB perl-Test-Simple noarch 3:1.302214-1.fc43 fedora 1.7 MiB perl-Text-Abbrev noarch 1.02-517.fc43 fedora 3.1 KiB perl-Text-Balanced noarch 2.06-512.fc42 fedora 111.4 KiB perl-Text-Diff noarch 1.45-23.fc42 fedora 83.0 KiB perl-Text-Glob noarch 0.11-25.fc42 fedora 8.4 KiB perl-Text-ParseWords noarch 3.31-512.fc42 fedora 13.6 KiB perl-Text-Tabs+Wrap noarch 2024.001-512.fc42 fedora 22.6 KiB perl-Text-Template noarch 1.61-7.fc42 fedora 112.4 KiB perl-Thread noarch 3.05-517.fc43 fedora 12.1 KiB perl-Thread-Queue noarch 3.14-512.fc42 fedora 28.9 KiB perl-Thread-Semaphore noarch 2.13-517.fc43 fedora 10.0 KiB perl-Tie noarch 4.6-517.fc43 fedora 32.0 KiB perl-Tie-File noarch 1.09-517.fc43 fedora 85.7 KiB perl-Tie-Memoize noarch 1.1-517.fc43 fedora 6.2 KiB perl-Tie-RefHash noarch 1.41-2.fc42 fedora 35.9 KiB perl-Time noarch 1.04-517.fc43 fedora 9.7 KiB perl-Time-HiRes x86_64 4:1.9777-512.fc42 fedora 115.8 KiB perl-Time-Local noarch 2:1.350-512.fc42 fedora 68.9 KiB perl-Time-Piece x86_64 1.3401-517.fc43 fedora 71.0 KiB perl-URI noarch 5.32-1.fc43 fedora 261.2 KiB perl-Unicode-Collate x86_64 1.31-512.fc42 fedora 4.2 MiB perl-Unicode-Normalize x86_64 1.32-512.fc42 fedora 465.1 KiB perl-Unicode-UCD noarch 0.78-517.fc43 fedora 204.4 KiB perl-User-pwent noarch 1.05-517.fc43 fedora 17.0 KiB perl-autodie noarch 2.37-513.fc42 fedora 214.9 KiB perl-autouse noarch 1.11-517.fc43 fedora 5.9 KiB perl-base noarch 2.27-517.fc43 fedora 12.5 KiB perl-bignum noarch 0.67-513.fc42 fedora 133.1 KiB perl-blib noarch 1.07-517.fc43 fedora 3.2 KiB perl-constant noarch 1.33-513.fc42 fedora 26.2 KiB perl-debugger noarch 1.60-517.fc43 fedora 402.2 KiB perl-deprecate noarch 0.04-517.fc43 fedora 6.5 KiB perl-devel x86_64 4:5.40.2-517.fc43 fedora 8.0 MiB perl-diagnostics noarch 1.40-517.fc43 fedora 465.4 KiB perl-doc noarch 5.40.2-517.fc43 fedora 11.0 MiB perl-encoding x86_64 4:3.00-512.fc42 fedora 149.5 KiB perl-encoding-warnings noarch 0.14-517.fc43 fedora 10.1 KiB perl-experimental noarch 0.035-1.fc43 fedora 41.5 KiB perl-fields noarch 2.27-517.fc43 fedora 11.8 KiB perl-filetest noarch 1.03-517.fc43 fedora 6.4 KiB perl-if noarch 0.61.000-517.fc43 fedora 5.8 KiB perl-inc-latest noarch 2:0.500-30.fc42 fedora 34.6 KiB perl-interpreter x86_64 4:5.40.2-517.fc43 fedora 118.3 KiB perl-less noarch 0.03-517.fc43 fedora 4.9 KiB perl-lib x86_64 0.65-517.fc43 fedora 8.5 KiB perl-libnet noarch 3.15-513.fc42 fedora 289.4 KiB perl-libnetcfg noarch 4:5.40.2-517.fc43 fedora 16.9 KiB perl-libs x86_64 4:5.40.2-517.fc43 fedora 9.8 MiB perl-local-lib noarch 2.000029-9.fc42 fedora 117.6 KiB perl-locale noarch 1.12-517.fc43 fedora 6.5 KiB perl-macros noarch 4:5.40.2-517.fc43 fedora 5.5 KiB perl-meta-notation noarch 5.40.2-517.fc43 fedora 2.0 KiB perl-mro x86_64 1.29-517.fc43 fedora 41.5 KiB perl-open noarch 1.13-517.fc43 fedora 11.3 KiB perl-overload noarch 1.37-517.fc43 fedora 71.5 KiB perl-overloading noarch 0.02-517.fc43 fedora 4.8 KiB perl-parent noarch 1:0.244-2.fc42 fedora 10.3 KiB perl-perlfaq noarch 5.20240218-512.fc42 fedora 732.6 KiB perl-ph x86_64 5.40.2-517.fc43 fedora 271.3 KiB perl-podlators noarch 1:6.0.2-3.fc42 fedora 317.5 KiB perl-sigtrap noarch 1.10-517.fc43 fedora 11.0 KiB perl-sort noarch 2.05-517.fc43 fedora 4.8 KiB perl-subs noarch 1.04-517.fc43 fedora 2.1 KiB perl-threads x86_64 1:2.40-512.fc42 fedora 115.0 KiB perl-threads-shared x86_64 1.69-512.fc42 fedora 83.6 KiB perl-utils noarch 5.40.2-517.fc43 fedora 96.8 KiB perl-vars noarch 1.05-517.fc43 fedora 3.9 KiB perl-version x86_64 9:0.99.33-2.fc42 fedora 128.7 KiB perl-vmsish noarch 1.04-517.fc43 fedora 6.5 KiB procps-ng x86_64 4.0.4-6.fc42 fedora 1.0 MiB python-pip-wheel noarch 25.1.1-4.fc43 fedora 1.2 MiB python3 x86_64 3.14.0~b2-3.fc43 fedora 28.9 KiB python3-libs x86_64 3.14.0~b2-3.fc43 fedora 42.7 MiB python3-pyparsing noarch 3.1.2-10.fc43 fedora 1.0 MiB rhash x86_64 1.4.5-2.fc42 fedora 351.0 KiB rocm-clang x86_64 19-10.rocm6.4.1.fc43 copr_base 70.2 MiB rocm-clang-devel x86_64 19-10.rocm6.4.1.fc43 copr_base 23.3 MiB rocm-clang-libs x86_64 19-10.rocm6.4.1.fc43 copr_base 98.4 MiB rocm-clang-runtime-devel x86_64 19-10.rocm6.4.1.fc43 copr_base 6.9 MiB rocm-comgr x86_64 19-10.rocm6.4.1.fc43 copr_base 123.9 MiB rocm-core x86_64 6.4.1-1.fc43 copr_base 12.3 KiB rocm-device-libs x86_64 19-10.rocm6.4.1.fc43 copr_base 3.2 MiB rocm-hip x86_64 6.4.1-2.fc43 fedora 24.9 MiB rocm-libc++ x86_64 19-10.rocm6.4.1.fc43 copr_base 1.2 MiB rocm-libc++-devel x86_64 19-10.rocm6.4.1.fc43 copr_base 7.5 MiB rocm-lld x86_64 19-10.rocm6.4.1.fc43 copr_base 5.7 MiB rocm-llvm x86_64 19-10.rocm6.4.1.fc43 copr_base 48.4 MiB rocm-llvm-devel x86_64 19-10.rocm6.4.1.fc43 copr_base 25.3 MiB rocm-llvm-filesystem x86_64 19-10.rocm6.4.1.fc43 copr_base 0.0 B rocm-llvm-libs x86_64 19-10.rocm6.4.1.fc43 copr_base 84.7 MiB rocm-llvm-static x86_64 19-10.rocm6.4.1.fc43 copr_base 250.2 MiB rocm-runtime x86_64 6.4.1-1.fc43 copr_base 3.1 MiB rocm-smi x86_64 6.4.1-1.fc43 copr_base 2.7 MiB systemtap-sdt-devel x86_64 5.3-2.fc43 fedora 182.9 KiB systemtap-sdt-dtrace x86_64 5.3-2.fc43 fedora 179.6 KiB tcl x86_64 1:9.0.0-8.fc43 fedora 4.3 MiB tzdata noarch 2025b-1.fc43 fedora 1.6 MiB vim-filesystem noarch 2:9.1.1435-1.fc43 fedora 40.0 B zlib-ng-compat-devel x86_64 2.2.4-2.fc43 fedora 107.0 KiB Transaction Summary: Installing: 301 packages Total size of inbound packages is 295 MiB. Need to download 295 MiB. After this operation, 1 GiB extra will be used (install 1 GiB, remove 0 B). [ 1/301] gcc-c++-0:15.1.1-2.fc43.x86_6 100% | 53.4 MiB/s | 15.2 MiB | 00m00s [ 2/301] rocm-cmake-0:6.4.0-1.fc43.noa 100% | 3.1 MiB/s | 37.6 KiB | 00m00s [ 3/301] rocm-comgr-devel-0:19-10.rocm 100% | 2.4 MiB/s | 32.0 KiB | 00m00s [ 4/301] rocm-core-devel-0:6.4.1-1.fc4 100% | 109.8 KiB/s | 13.4 KiB | 00m00s [ 5/301] hipify-0:6.4.1-2.fc43.x86_64 100% | 745.5 KiB/s | 505.5 KiB | 00m01s [ 6/301] rocm-rpm-macros-0:6.4.0-4.fc4 100% | 71.2 KiB/s | 15.9 KiB | 00m00s [ 7/301] rocm-runtime-devel-0:6.4.1-1. 100% | 4.6 MiB/s | 93.7 KiB | 00m00s [ 8/301] rocm-smi-devel-0:6.4.1-1.fc43 100% | 5.1 MiB/s | 57.3 KiB | 00m00s [ 9/301] rocm-hip-devel-0:6.4.1-2.fc43 100% | 461.4 KiB/s | 247.8 KiB | 00m01s [ 10/301] cmake-filesystem-0:3.31.6-3.f 100% | 225.1 KiB/s | 16.4 KiB | 00m00s [ 11/301] expat-0:2.7.1-1.fc43.x86_64 100% | 794.0 KiB/s | 115.9 KiB | 00m00s [ 12/301] jsoncpp-0:1.9.6-1.fc43.x86_64 100% | 639.0 KiB/s | 101.6 KiB | 00m00s [ 13/301] libuv-1:1.51.0-1.fc43.x86_64 100% | 918.5 KiB/s | 266.4 KiB | 00m00s [ 14/301] make-1:4.4.1-10.fc42.x86_64 100% | 1.1 MiB/s | 587.0 KiB | 00m01s [ 15/301] rhash-0:1.4.5-2.fc42.x86_64 100% | 1.2 MiB/s | 198.7 KiB | 00m00s [ 16/301] libmpc-0:1.3.1-7.fc42.x86_64 100% | 971.3 KiB/s | 70.9 KiB | 00m00s [ 17/301] perl-4:5.40.2-517.fc43.x86_64 100% | 169.7 KiB/s | 13.9 KiB | 00m00s [ 18/301] perl-interpreter-4:5.40.2-517 100% | 991.9 KiB/s | 72.4 KiB | 00m00s [ 19/301] perl-File-Basename-0:2.86-517 100% | 241.8 KiB/s | 17.4 KiB | 00m00s [ 20/301] perl-File-Copy-0:2.41-517.fc4 100% | 282.6 KiB/s | 20.3 KiB | 00m00s [ 21/301] cmake-data-0:3.31.6-3.fc43.no 100% | 1.3 MiB/s | 2.5 MiB | 00m02s [ 22/301] perl-Getopt-Std-0:1.14-517.fc 100% | 221.1 KiB/s | 15.9 KiB | 00m00s [ 23/301] perl-File-Which-0:1.27-13.fc4 100% | 77.7 KiB/s | 21.6 KiB | 00m00s [ 24/301] perl-PathTools-0:3.91-513.fc4 100% | 1.2 MiB/s | 87.3 KiB | 00m00s [ 25/301] perl-Scalar-List-Utils-5:1.69 100% | 1.0 MiB/s | 74.8 KiB | 00m00s [ 26/301] perl-URI-0:5.32-1.fc43.noarch 100% | 1.9 MiB/s | 143.5 KiB | 00m00s [ 27/301] environment-modules-0:5.5.0-3 100% | 1.5 MiB/s | 764.7 KiB | 00m01s [ 28/301] emacs-filesystem-1:30.0-4.fc4 100% | 102.2 KiB/s | 7.4 KiB | 00m00s [ 29/301] vim-filesystem-2:9.1.1435-1.f 100% | 180.2 KiB/s | 15.3 KiB | 00m00s [ 30/301] perl-Archive-Tar-0:3.04-1.fc4 100% | 845.7 KiB/s | 71.0 KiB | 00m00s [ 31/301] perl-Attribute-Handlers-0:1.0 100% | 114.5 KiB/s | 28.3 KiB | 00m00s [ 32/301] perl-AutoLoader-0:5.74-517.fc 100% | 298.2 KiB/s | 21.5 KiB | 00m00s [ 33/301] perl-AutoSplit-0:5.74-517.fc4 100% | 303.6 KiB/s | 21.9 KiB | 00m00s [ 34/301] perl-B-0:1.89-517.fc43.x86_64 100% | 2.3 MiB/s | 177.0 KiB | 00m00s [ 35/301] cmake-0:3.31.6-3.fc43.x86_64 100% | 2.8 MiB/s | 12.2 MiB | 00m04s [ 36/301] perl-Benchmark-0:1.25-517.fc4 100% | 370.3 KiB/s | 27.0 KiB | 00m00s [ 37/301] perl-CPAN-0:2.38-4.fc43.noarc 100% | 4.9 MiB/s | 567.2 KiB | 00m00s [ 38/301] perl-CPAN-Meta-0:2.150010-512 100% | 2.1 MiB/s | 190.8 KiB | 00m00s [ 39/301] perl-CPAN-Meta-Requirements-0 100% | 399.6 KiB/s | 35.2 KiB | 00m00s [ 40/301] perl-CPAN-Meta-YAML-0:0.020-2 100% | 367.3 KiB/s | 26.8 KiB | 00m00s [ 41/301] perl-Carp-0:1.54-512.fc42.noa 100% | 400.9 KiB/s | 28.9 KiB | 00m00s [ 42/301] perl-Class-Struct-0:0.68-517. 100% | 309.8 KiB/s | 22.3 KiB | 00m00s [ 43/301] perl-Compress-Raw-Bzip2-0:2.2 100% | 448.7 KiB/s | 36.3 KiB | 00m00s [ 44/301] perl-Compress-Raw-Zlib-0:2.21 100% | 744.1 KiB/s | 65.5 KiB | 00m00s [ 45/301] perl-Config-Extensions-0:0.03 100% | 173.6 KiB/s | 12.5 KiB | 00m00s [ 46/301] perl-Config-Perl-V-0:0.38-2.f 100% | 244.5 KiB/s | 21.8 KiB | 00m00s [ 47/301] perl-DBM_Filter-0:0.06-517.fc 100% | 345.3 KiB/s | 27.3 KiB | 00m00s [ 48/301] perl-DB_File-0:1.859-513.fc42 100% | 1.0 MiB/s | 81.0 KiB | 00m00s [ 49/301] perl-Data-Dumper-0:2.189-513. 100% | 776.0 KiB/s | 56.7 KiB | 00m00s [ 50/301] perl-Devel-Peek-0:1.34-517.fc 100% | 440.3 KiB/s | 32.1 KiB | 00m00s [ 51/301] perl-Devel-PPPort-0:3.72-513. 100% | 1.3 MiB/s | 220.8 KiB | 00m00s [ 52/301] perl-Devel-SelfStubber-0:1.06 100% | 202.2 KiB/s | 14.6 KiB | 00m00s [ 53/301] perl-Digest-0:1.20-512.fc42.n 100% | 341.6 KiB/s | 24.9 KiB | 00m00s [ 54/301] perl-Digest-MD5-0:2.59-6.fc42 100% | 492.9 KiB/s | 36.0 KiB | 00m00s [ 55/301] perl-Digest-SHA-1:6.04-513.fc 100% | 715.0 KiB/s | 62.2 KiB | 00m00s [ 56/301] perl-DirHandle-0:1.05-517.fc4 100% | 176.3 KiB/s | 12.7 KiB | 00m00s [ 57/301] perl-Dumpvalue-0:2.27-517.fc4 100% | 254.2 KiB/s | 18.6 KiB | 00m00s [ 58/301] perl-DynaLoader-0:1.56-517.fc 100% | 365.1 KiB/s | 26.3 KiB | 00m00s [ 59/301] perl-English-0:1.11-517.fc43. 100% | 192.0 KiB/s | 13.8 KiB | 00m00s [ 60/301] perl-Env-0:1.06-512.fc42.noar 100% | 237.3 KiB/s | 19.7 KiB | 00m00s [ 61/301] perl-Errno-0:1.38-517.fc43.x8 100% | 208.0 KiB/s | 15.2 KiB | 00m00s [ 62/301] perl-Exporter-0:5.78-512.fc42 100% | 430.4 KiB/s | 31.0 KiB | 00m00s [ 63/301] rocm-hip-0:6.4.1-2.fc43.x86_6 100% | 3.9 MiB/s | 9.5 MiB | 00m02s [ 64/301] perl-ExtUtils-CBuilder-1:0.28 100% | 588.4 KiB/s | 50.6 KiB | 00m00s [ 65/301] perl-ExtUtils-Command-2:7.76- 100% | 162.5 KiB/s | 14.0 KiB | 00m00s [ 66/301] perl-ExtUtils-Constant-0:0.25 100% | 601.9 KiB/s | 43.9 KiB | 00m00s [ 67/301] perl-ExtUtils-Embed-0:1.35-51 100% | 248.8 KiB/s | 17.9 KiB | 00m00s [ 68/301] perl-ExtUtils-Install-0:2.22- 100% | 511.2 KiB/s | 43.5 KiB | 00m00s [ 69/301] perl-ExtUtils-MM-Utils-2:7.76 100% | 160.4 KiB/s | 11.5 KiB | 00m00s [ 70/301] perl-ExtUtils-Miniperl-0:1.14 100% | 211.9 KiB/s | 15.3 KiB | 00m00s [ 71/301] perl-ExtUtils-Manifest-1:1.75 100% | 392.4 KiB/s | 34.1 KiB | 00m00s [ 72/301] perl-ExtUtils-MakeMaker-2:7.7 100% | 2.0 MiB/s | 294.6 KiB | 00m00s [ 73/301] perl-Fcntl-0:1.18-517.fc43.x8 100% | 417.6 KiB/s | 30.1 KiB | 00m00s [ 74/301] perl-ExtUtils-ParseXS-1:3.57- 100% | 2.1 MiB/s | 207.6 KiB | 00m00s [ 75/301] perl-File-Compare-0:1.100.800 100% | 184.7 KiB/s | 13.5 KiB | 00m00s [ 76/301] perl-File-DosGlob-0:1.12-517. 100% | 244.6 KiB/s | 19.8 KiB | 00m00s [ 77/301] perl-File-Fetch-0:1.08-1.fc43 100% | 358.1 KiB/s | 30.8 KiB | 00m00s [ 78/301] perl-File-Find-0:1.44-517.fc4 100% | 354.9 KiB/s | 25.6 KiB | 00m00s [ 79/301] perl-File-Path-0:2.18-512.fc4 100% | 488.5 KiB/s | 35.2 KiB | 00m00s [ 80/301] perl-File-Temp-1:0.231.100-51 100% | 810.5 KiB/s | 59.2 KiB | 00m00s [ 81/301] perl-File-stat-0:1.14-517.fc4 100% | 236.9 KiB/s | 17.3 KiB | 00m00s [ 82/301] perl-FileCache-0:1.10-517.fc4 100% | 204.7 KiB/s | 14.9 KiB | 00m00s [ 83/301] perl-FileHandle-0:2.05-517.fc 100% | 215.5 KiB/s | 15.7 KiB | 00m00s [ 84/301] perl-Filter-2:1.64-513.fc42.x 100% | 1.0 MiB/s | 86.0 KiB | 00m00s [ 85/301] perl-Filter-Simple-0:0.96-512 100% | 318.0 KiB/s | 27.0 KiB | 00m00s [ 86/301] perl-FindBin-0:1.54-517.fc43. 100% | 198.1 KiB/s | 14.5 KiB | 00m00s [ 87/301] perl-GDBM_File-1:1.24-517.fc4 100% | 585.7 KiB/s | 42.8 KiB | 00m00s [ 88/301] perl-Getopt-Long-1:2.58-3.fc4 100% | 872.7 KiB/s | 63.7 KiB | 00m00s [ 89/301] perl-HTTP-Tiny-0:0.090-2.fc42 100% | 774.4 KiB/s | 56.5 KiB | 00m00s [ 90/301] perl-Hash-Util-0:0.32-517.fc4 100% | 476.4 KiB/s | 34.8 KiB | 00m00s [ 91/301] perl-Hash-Util-FieldHash-0:1. 100% | 533.8 KiB/s | 39.0 KiB | 00m00s [ 92/301] perl-I18N-Collate-0:1.02-517. 100% | 197.5 KiB/s | 14.4 KiB | 00m00s [ 93/301] perl-I18N-LangTags-0:0.45-517 100% | 721.9 KiB/s | 52.7 KiB | 00m00s [ 94/301] perl-I18N-Langinfo-0:0.24-517 100% | 354.2 KiB/s | 25.9 KiB | 00m00s [ 95/301] perl-IO-0:1.55-517.fc43.x86_6 100% | 1.1 MiB/s | 82.0 KiB | 00m00s [ 96/301] perl-IO-Socket-IP-0:0.43-2.fc 100% | 588.4 KiB/s | 42.4 KiB | 00m00s [ 97/301] perl-IO-Compress-0:2.213-3.fc 100% | 2.0 MiB/s | 305.7 KiB | 00m00s [ 98/301] perl-IO-Zlib-1:1.15-512.fc42. 100% | 223.7 KiB/s | 19.7 KiB | 00m00s [ 99/301] perl-IPC-Cmd-2:1.04-513.fc42. 100% | 544.0 KiB/s | 39.7 KiB | 00m00s [100/301] perl-IPC-Open3-0:1.22-517.fc4 100% | 306.8 KiB/s | 22.1 KiB | 00m00s [101/301] perl-IPC-SysV-0:2.09-513.fc42 100% | 510.3 KiB/s | 40.8 KiB | 00m00s [102/301] perl-JSON-PP-1:4.16-513.fc42. 100% | 897.8 KiB/s | 65.5 KiB | 00m00s [103/301] perl-Locale-Maketext-0:1.33-5 100% | 1.2 MiB/s | 93.7 KiB | 00m00s [104/301] perl-Locale-Maketext-Simple-1 100% | 244.1 KiB/s | 17.8 KiB | 00m00s [105/301] perl-MIME-Base64-0:3.16-512.f 100% | 414.6 KiB/s | 29.9 KiB | 00m00s [106/301] perl-Math-BigInt-1:2.0050.03- 100% | 3.1 MiB/s | 234.6 KiB | 00m00s [107/301] perl-Math-BigInt-FastCalc-0:0 100% | 336.0 KiB/s | 28.2 KiB | 00m00s [108/301] perl-Math-Complex-0:1.62-517. 100% | 643.1 KiB/s | 46.3 KiB | 00m00s [109/301] perl-Memoize-0:1.16-517.fc43. 100% | 638.1 KiB/s | 46.6 KiB | 00m00s [110/301] perl-Module-CoreList-1:5.2025 100% | 1.2 MiB/s | 92.6 KiB | 00m00s [111/301] perl-Module-CoreList-tools-1: 100% | 227.2 KiB/s | 18.9 KiB | 00m00s [112/301] perl-Module-Load-1:0.36-512.f 100% | 240.1 KiB/s | 17.3 KiB | 00m00s [113/301] perl-Module-Load-Conditional- 100% | 301.9 KiB/s | 22.0 KiB | 00m00s [114/301] perl-Module-Metadata-0:1.0000 100% | 484.6 KiB/s | 35.4 KiB | 00m00s [115/301] perl-Module-Loaded-1:0.08-517 100% | 156.7 KiB/s | 13.6 KiB | 00m00s [116/301] perl-NDBM_File-0:1.17-517.fc4 100% | 313.2 KiB/s | 22.9 KiB | 00m00s [117/301] perl-NEXT-0:0.69-517.fc43.noa 100% | 261.4 KiB/s | 21.2 KiB | 00m00s [118/301] perl-Net-0:1.04-517.fc43.noar 100% | 299.2 KiB/s | 22.7 KiB | 00m00s [119/301] perl-Net-Ping-0:2.76-512.fc42 100% | 563.7 KiB/s | 49.6 KiB | 00m00s [120/301] perl-ODBM_File-0:1.18-517.fc4 100% | 313.9 KiB/s | 22.9 KiB | 00m00s [121/301] perl-Opcode-0:1.65-517.fc43.x 100% | 493.2 KiB/s | 36.0 KiB | 00m00s [122/301] perl-POSIX-0:2.20-517.fc43.x8 100% | 1.3 MiB/s | 97.8 KiB | 00m00s [123/301] perl-Params-Check-1:0.38-512. 100% | 298.8 KiB/s | 21.8 KiB | 00m00s [124/301] perl-Perl-OSType-0:1.010-513. 100% | 289.1 KiB/s | 22.8 KiB | 00m00s [125/301] perl-PerlIO-via-QuotedPrint-0 100% | 252.2 KiB/s | 21.7 KiB | 00m00s [126/301] perl-Pod-Checker-4:1.77-512.f 100% | 412.7 KiB/s | 31.8 KiB | 00m00s [127/301] perl-Pod-Escapes-1:1.07-512.f 100% | 275.2 KiB/s | 19.8 KiB | 00m00s [128/301] perl-Pod-Functions-0:1.14-517 100% | 179.3 KiB/s | 14.9 KiB | 00m00s [129/301] perl-Pod-Html-0:1.35-517.fc43 100% | 412.0 KiB/s | 29.7 KiB | 00m00s [130/301] perl-Pod-Perldoc-0:3.28.01-51 100% | 1.1 MiB/s | 85.8 KiB | 00m00s [131/301] perl-Pod-Simple-1:3.47-1.fc43 100% | 2.9 MiB/s | 219.9 KiB | 00m00s [132/301] perl-Pod-Usage-4:2.05-1.fc43. 100% | 555.7 KiB/s | 40.6 KiB | 00m00s [133/301] perl-Safe-0:2.46-517.fc43.noa 100% | 344.4 KiB/s | 25.1 KiB | 00m00s [134/301] perl-Search-Dict-0:1.07-517.f 100% | 181.6 KiB/s | 13.3 KiB | 00m00s [135/301] perl-SelectSaver-0:1.02-517.f 100% | 166.1 KiB/s | 12.0 KiB | 00m00s [136/301] perl-SelfLoader-0:1.27-517.fc 100% | 298.7 KiB/s | 21.8 KiB | 00m00s [137/301] perl-Socket-4:2.038-512.fc42. 100% | 750.1 KiB/s | 54.8 KiB | 00m00s [138/301] perl-Storable-1:3.32-512.fc42 100% | 1.3 MiB/s | 99.6 KiB | 00m00s [139/301] perl-Symbol-0:1.09-517.fc43.n 100% | 197.8 KiB/s | 14.4 KiB | 00m00s [140/301] perl-Sys-Hostname-0:1.25-517. 100% | 238.0 KiB/s | 17.4 KiB | 00m00s [141/301] perl-Term-ANSIColor-0:5.01-51 100% | 653.3 KiB/s | 47.7 KiB | 00m00s [142/301] perl-Sys-Syslog-0:0.36-513.fc 100% | 541.8 KiB/s | 46.6 KiB | 00m00s [143/301] perl-Term-Cap-0:1.18-512.fc42 100% | 303.5 KiB/s | 22.2 KiB | 00m00s [144/301] perl-Term-Complete-0:1.403-51 100% | 181.3 KiB/s | 13.2 KiB | 00m00s [145/301] perl-Term-ReadLine-0:1.17-517 100% | 267.7 KiB/s | 19.3 KiB | 00m00s [146/301] perl-Term-Table-0:0.024-2.fc4 100% | 501.0 KiB/s | 43.1 KiB | 00m00s [147/301] perl-Test-0:1.31-517.fc43.noa 100% | 394.3 KiB/s | 28.8 KiB | 00m00s [148/301] perl-Test-Harness-1:3.52-1.fc 100% | 3.2 MiB/s | 277.3 KiB | 00m00s [149/301] perl-Text-Abbrev-0:1.02-517.f 100% | 137.7 KiB/s | 12.4 KiB | 00m00s [150/301] perl-Text-Balanced-0:2.06-512 100% | 574.1 KiB/s | 48.8 KiB | 00m00s [151/301] perl-Test-Simple-3:1.302214-1 100% | 5.2 MiB/s | 863.2 KiB | 00m00s [152/301] perl-Text-ParseWords-0:3.31-5 100% | 228.9 KiB/s | 16.5 KiB | 00m00s [153/301] perl-Text-Tabs+Wrap-0:2024.00 100% | 302.8 KiB/s | 21.8 KiB | 00m00s [154/301] perl-Thread-0:3.05-517.fc43.n 100% | 249.9 KiB/s | 18.2 KiB | 00m00s [155/301] perl-Thread-Queue-0:3.14-512. 100% | 293.2 KiB/s | 21.4 KiB | 00m00s [156/301] perl-Thread-Semaphore-0:2.13- 100% | 217.5 KiB/s | 15.9 KiB | 00m00s [157/301] perl-Tie-0:4.6-517.fc43.noarc 100% | 382.5 KiB/s | 27.9 KiB | 00m00s [158/301] perl-Tie-File-0:1.09-517.fc43 100% | 596.7 KiB/s | 43.6 KiB | 00m00s [159/301] perl-Tie-Memoize-0:1.1-517.fc 100% | 199.3 KiB/s | 14.3 KiB | 00m00s [160/301] perl-Tie-RefHash-0:1.41-2.fc4 100% | 284.2 KiB/s | 23.6 KiB | 00m00s [161/301] perl-Time-0:1.04-517.fc43.noa 100% | 232.5 KiB/s | 17.0 KiB | 00m00s [162/301] perl-Time-HiRes-4:1.9777-512. 100% | 668.5 KiB/s | 57.5 KiB | 00m00s [163/301] perl-Time-Local-2:1.350-512.f 100% | 472.1 KiB/s | 34.5 KiB | 00m00s [164/301] perl-Time-Piece-0:1.3401-517. 100% | 554.0 KiB/s | 40.4 KiB | 00m00s [165/301] perl-Unicode-Collate-0:1.31-5 100% | 5.7 MiB/s | 645.6 KiB | 00m00s [166/301] perl-Unicode-Normalize-0:1.32 100% | 1.0 MiB/s | 74.1 KiB | 00m00s [167/301] perl-Unicode-UCD-0:0.78-517.f 100% | 1.0 MiB/s | 78.5 KiB | 00m00s [168/301] perl-User-pwent-0:1.05-517.fc 100% | 270.5 KiB/s | 19.7 KiB | 00m00s [169/301] perl-autouse-0:1.11-517.fc43. 100% | 192.2 KiB/s | 14.0 KiB | 00m00s [170/301] perl-autodie-0:2.37-513.fc42. 100% | 1.1 MiB/s | 96.9 KiB | 00m00s [171/301] perl-base-0:2.27-517.fc43.noa 100% | 228.4 KiB/s | 16.4 KiB | 00m00s [172/301] perl-blib-0:1.07-517.fc43.noa 100% | 173.1 KiB/s | 12.6 KiB | 00m00s [173/301] perl-bignum-0:0.67-513.fc42.n 100% | 604.5 KiB/s | 49.0 KiB | 00m00s [174/301] perl-constant-0:1.33-513.fc42 100% | 319.2 KiB/s | 23.0 KiB | 00m00s [175/301] perl-debugger-0:1.60-517.fc43 100% | 1.5 MiB/s | 133.3 KiB | 00m00s [176/301] perl-deprecate-0:0.04-517.fc4 100% | 170.2 KiB/s | 14.8 KiB | 00m00s [177/301] perl-diagnostics-0:1.40-517.f 100% | 2.1 MiB/s | 217.8 KiB | 00m00s [178/301] perl-devel-4:5.40.2-517.fc43. 100% | 4.8 MiB/s | 764.3 KiB | 00m00s [179/301] perl-encoding-4:3.00-512.fc42 100% | 777.8 KiB/s | 63.0 KiB | 00m00s [180/301] perl-encoding-warnings-0:0.14 100% | 215.4 KiB/s | 16.8 KiB | 00m00s [181/301] perl-experimental-0:0.035-1.f 100% | 360.4 KiB/s | 26.7 KiB | 00m00s [182/301] perl-fields-0:2.27-517.fc43.n 100% | 215.4 KiB/s | 16.4 KiB | 00m00s [183/301] perl-filetest-0:1.03-517.fc43 100% | 195.1 KiB/s | 14.8 KiB | 00m00s [184/301] perl-if-0:0.61.000-517.fc43.n 100% | 197.7 KiB/s | 14.2 KiB | 00m00s [185/301] perl-less-0:0.03-517.fc43.noa 100% | 167.8 KiB/s | 13.4 KiB | 00m00s [186/301] perl-lib-0:0.65-517.fc43.x86_ 100% | 210.9 KiB/s | 15.2 KiB | 00m00s [187/301] perl-libnet-0:3.15-513.fc42.n 100% | 1.7 MiB/s | 128.4 KiB | 00m00s [188/301] perl-libnetcfg-4:5.40.2-517.f 100% | 202.0 KiB/s | 16.6 KiB | 00m00s [189/301] perl-locale-0:1.12-517.fc43.n 100% | 192.4 KiB/s | 13.9 KiB | 00m00s [190/301] perl-macros-4:5.40.2-517.fc43 100% | 165.3 KiB/s | 12.6 KiB | 00m00s [191/301] perl-meta-notation-0:5.40.2-5 100% | 127.1 KiB/s | 10.9 KiB | 00m00s [192/301] perl-libs-4:5.40.2-517.fc43.x 100% | 7.9 MiB/s | 2.3 MiB | 00m00s [193/301] perl-mro-0:1.29-517.fc43.x86_ 100% | 418.5 KiB/s | 30.1 KiB | 00m00s [194/301] perl-open-0:1.13-517.fc43.noa 100% | 229.6 KiB/s | 16.8 KiB | 00m00s [195/301] perl-overload-0:1.37-517.fc43 100% | 635.9 KiB/s | 45.8 KiB | 00m00s [196/301] perl-overloading-0:0.02-517.f 100% | 182.4 KiB/s | 13.1 KiB | 00m00s [197/301] perl-parent-1:0.244-2.fc42.no 100% | 211.5 KiB/s | 15.2 KiB | 00m00s [198/301] perl-perlfaq-0:5.20240218-512 100% | 3.6 MiB/s | 378.4 KiB | 00m00s [199/301] perl-ph-0:5.40.2-517.fc43.x86 100% | 671.3 KiB/s | 49.0 KiB | 00m00s [200/301] perl-podlators-1:6.0.2-3.fc42 100% | 1.7 MiB/s | 128.6 KiB | 00m00s [201/301] perl-sigtrap-0:1.10-517.fc43. 100% | 220.7 KiB/s | 15.9 KiB | 00m00s [202/301] perl-doc-0:5.40.2-517.fc43.no 100% | 4.4 MiB/s | 4.9 MiB | 00m01s [203/301] perl-subs-0:1.04-517.fc43.noa 100% | 166.0 KiB/s | 12.0 KiB | 00m00s [204/301] perl-sort-0:2.05-517.fc43.noa 100% | 167.9 KiB/s | 13.4 KiB | 00m00s [205/301] perl-threads-1:2.40-512.fc42. 100% | 794.9 KiB/s | 58.0 KiB | 00m00s [206/301] perl-threads-shared-0:1.69-51 100% | 609.4 KiB/s | 44.5 KiB | 00m00s [207/301] perl-vars-0:1.05-517.fc43.noa 100% | 181.4 KiB/s | 13.2 KiB | 00m00s [208/301] perl-utils-0:5.40.2-517.fc43. 100% | 625.5 KiB/s | 52.5 KiB | 00m00s [209/301] perl-version-9:0.99.33-2.fc42 100% | 758.5 KiB/s | 63.0 KiB | 00m00s [210/301] perl-vmsish-0:1.04-517.fc43.n 100% | 174.6 KiB/s | 14.3 KiB | 00m00s [211/301] perl-MIME-Base32-0:1.303-23.f 100% | 284.9 KiB/s | 20.5 KiB | 00m00s [212/301] numactl-libs-0:2.0.19-2.fc42. 100% | 434.4 KiB/s | 31.3 KiB | 00m00s [213/301] less-0:678-1.fc43.x86_64 100% | 2.6 MiB/s | 195.1 KiB | 00m00s [214/301] perl-IO-Compress-Lzma-0:2.213 100% | 892.2 KiB/s | 76.7 KiB | 00m00s [215/301] perl-Text-Diff-0:1.45-23.fc42 100% | 471.7 KiB/s | 40.1 KiB | 00m00s [216/301] man-db-0:2.13.1-1.fc43.x86_64 100% | 7.2 MiB/s | 1.4 MiB | 00m00s [217/301] perl-Archive-Zip-0:1.68-16.fc 100% | 1.3 MiB/s | 111.5 KiB | 00m00s [218/301] perl-Compress-Bzip2-0:2.28-21 100% | 762.3 KiB/s | 67.1 KiB | 00m00s [219/301] perl-Devel-Size-0:0.85-1.fc43 100% | 356.6 KiB/s | 30.7 KiB | 00m00s [220/301] perl-File-HomeDir-0:1.006-14. 100% | 689.5 KiB/s | 59.3 KiB | 00m00s [221/301] perl-Module-Build-2:0.42.34-8 100% | 2.7 MiB/s | 251.5 KiB | 00m00s [222/301] perl-Module-Signature-0:0.90- 100% | 973.1 KiB/s | 86.6 KiB | 00m00s [223/301] perl-Text-Glob-0:0.11-25.fc42 100% | 161.0 KiB/s | 13.4 KiB | 00m00s [224/301] perl-local-lib-0:2.000029-9.f 100% | 762.6 KiB/s | 66.3 KiB | 00m00s [225/301] libdb-0:5.3.28-65.fc43.x86_64 100% | 6.8 MiB/s | 770.8 KiB | 00m00s [226/301] perl-IO-Socket-SSL-0:2.091-1. 100% | 3.0 MiB/s | 230.3 KiB | 00m00s [227/301] perl-Net-SSLeay-0:1.94-9.fc43 100% | 4.8 MiB/s | 375.5 KiB | 00m00s [228/301] ncurses-0:6.5-5.20250125.fc42 100% | 5.3 MiB/s | 424.5 KiB | 00m00s [229/301] perl-IPC-System-Simple-0:1.30 100% | 531.0 KiB/s | 38.8 KiB | 00m00s [230/301] groff-base-0:1.23.0-8.fc42.x8 100% | 7.5 MiB/s | 1.1 MiB | 00m00s [231/301] libxcrypt-devel-0:4.4.38-7.fc 100% | 408.0 KiB/s | 29.4 KiB | 00m00s [232/301] systemtap-sdt-dtrace-0:5.3-2. 100% | 807.5 KiB/s | 69.4 KiB | 00m00s [233/301] libpipeline-0:1.5.8-2.fc42.x8 100% | 759.8 KiB/s | 60.0 KiB | 00m00s [234/301] perl-Compress-Raw-Lzma-0:2.21 100% | 712.8 KiB/s | 52.0 KiB | 00m00s [235/301] perl-Algorithm-Diff-0:1.2010- 100% | 515.3 KiB/s | 46.4 KiB | 00m00s [236/301] perl-Software-License-0:0.104 100% | 1.7 MiB/s | 148.0 KiB | 00m00s [237/301] perl-inc-latest-2:0.500-30.fc 100% | 274.3 KiB/s | 23.3 KiB | 00m00s [238/301] perl-Data-Section-0:0.200008- 100% | 346.2 KiB/s | 24.9 KiB | 00m00s [239/301] python3-pyparsing-0:3.1.2-10. 100% | 3.5 MiB/s | 286.9 KiB | 00m00s [240/301] glibc-devel-0:2.41.9000-15.fc 100% | 3.7 MiB/s | 553.4 KiB | 00m00s [241/301] perl-MRO-Compat-0:0.15-11.fc4 100% | 348.6 KiB/s | 25.4 KiB | 00m00s [242/301] perl-Text-Template-0:1.61-7.f 100% | 671.8 KiB/s | 59.1 KiB | 00m00s [243/301] perl-Sub-Exporter-0:0.991-5.f 100% | 1.0 MiB/s | 77.5 KiB | 00m00s [244/301] perl-Data-OptList-0:0.114-6.f 100% | 367.2 KiB/s | 26.8 KiB | 00m00s [245/301] perl-Package-Generator-0:1.10 100% | 311.3 KiB/s | 22.4 KiB | 00m00s [246/301] perl-Params-Util-0:1.102-17.f 100% | 393.9 KiB/s | 32.7 KiB | 00m00s [247/301] perl-Sub-Install-0:0.929-7.fc 100% | 310.1 KiB/s | 22.6 KiB | 00m00s [248/301] libdrm-devel-0:2.4.124-2.fc42 100% | 2.0 MiB/s | 179.7 KiB | 00m00s [249/301] libdrm-0:2.4.124-2.fc42.x86_6 100% | 2.1 MiB/s | 161.0 KiB | 00m00s [250/301] rocm-smi-0:6.4.1-1.fc43.x86_6 100% | 21.0 MiB/s | 603.1 KiB | 00m00s [251/301] libpciaccess-0:0.16-15.fc42.x 100% | 365.1 KiB/s | 26.3 KiB | 00m00s [252/301] python3-0:3.14.0~b2-3.fc43.x8 100% | 365.0 KiB/s | 26.6 KiB | 00m00s [253/301] mpdecimal-0:4.0.1-1.fc43.x86_ 100% | 1.3 MiB/s | 97.1 KiB | 00m00s [254/301] hwdata-0:0.396-1.fc43.noarch 100% | 7.5 MiB/s | 1.6 MiB | 00m00s [255/301] tzdata-0:2025b-1.fc43.noarch 100% | 8.8 MiB/s | 714.0 KiB | 00m00s [256/301] rocm-runtime-0:6.4.1-1.fc43.x 100% | 22.6 MiB/s | 649.3 KiB | 00m00s [257/301] rocm-core-0:6.4.1-1.fc43.x86_ 100% | 751.5 KiB/s | 13.5 KiB | 00m00s [258/301] python-pip-wheel-0:25.1.1-4.f 100% | 5.5 MiB/s | 1.2 MiB | 00m00s [259/301] rocm-device-libs-0:19-10.rocm 100% | 1.6 MiB/s | 490.3 KiB | 00m00s [260/301] python3-libs-0:3.14.0~b2-3.fc 100% | 13.0 MiB/s | 9.8 MiB | 00m01s [261/301] rocm-comgr-0:19-10.rocm6.4.1. 100% | 62.3 MiB/s | 30.5 MiB | 00m00s [262/301] rocm-clang-libs-0:19-10.rocm6 100% | 32.2 MiB/s | 22.8 MiB | 00m01s [263/301] rocm-llvm-libs-0:19-10.rocm6. 100% | 24.3 MiB/s | 20.2 MiB | 00m01s [264/301] libstdc++-devel-0:15.1.1-2.fc 100% | 7.8 MiB/s | 2.7 MiB | 00m00s [265/301] gcc-0:15.1.1-2.fc43.x86_64 100% | 35.2 MiB/s | 39.4 MiB | 00m01s [266/301] cpp-0:15.1.1-2.fc43.x86_64 100% | 27.8 MiB/s | 12.9 MiB | 00m00s [267/301] hipcc-0:19-10.rocm6.4.1.fc43. 100% | 420.7 KiB/s | 133.8 KiB | 00m00s [268/301] systemtap-sdt-devel-0:5.3-2.f 100% | 473.8 KiB/s | 68.7 KiB | 00m00s [269/301] perl-Encode-devel-4:3.21-512. 100% | 255.1 KiB/s | 41.1 KiB | 00m00s [270/301] libpciaccess-devel-0:0.16-15. 100% | 142.9 KiB/s | 12.4 KiB | 00m00s [271/301] perl-Encode-4:3.21-512.fc42.x 100% | 2.1 MiB/s | 1.1 MiB | 00m01s [272/301] procps-ng-0:4.0.4-6.fc42.x86_ 100% | 1.6 MiB/s | 365.3 KiB | 00m00s [273/301] kernel-headers-0:6.16.0-0.rc1 100% | 4.6 MiB/s | 1.7 MiB | 00m00s [274/301] rocm-libc++-0:19-10.rocm6.4.1 100% | 14.7 MiB/s | 345.8 KiB | 00m00s [275/301] rocm-llvm-filesystem-0:19-10. 100% | 1.9 MiB/s | 22.7 KiB | 00m00s [276/301] libtommath-0:1.3.1~rc1-5.fc42 100% | 748.5 KiB/s | 64.4 KiB | 00m00s [277/301] rocm-clang-devel-0:19-10.rocm 100% | 53.1 MiB/s | 2.4 MiB | 00m00s [278/301] tcl-1:9.0.0-8.fc43.x86_64 100% | 5.4 MiB/s | 1.2 MiB | 00m00s [279/301] rocm-lld-0:19-10.rocm6.4.1.fc 100% | 8.8 MiB/s | 1.5 MiB | 00m00s [280/301] git-0:2.49.0-2.fc43.x86_64 100% | 571.1 KiB/s | 51.4 KiB | 00m00s [281/301] rocm-clang-0:19-10.rocm6.4.1. 100% | 42.2 MiB/s | 16.0 MiB | 00m00s [282/301] rocm-llvm-static-0:19-10.rocm 100% | 56.5 MiB/s | 29.3 MiB | 00m01s [283/301] perl-Git-0:2.49.0-2.fc43.noar 100% | 521.9 KiB/s | 38.1 KiB | 00m00s [284/301] perl-TermReadKey-0:2.38-24.fc 100% | 421.6 KiB/s | 35.4 KiB | 00m00s [285/301] git-core-doc-0:2.49.0-2.fc43. 100% | 10.1 MiB/s | 3.0 MiB | 00m00s [286/301] perl-Error-1:0.17030-1.fc43.n 100% | 561.3 KiB/s | 40.4 KiB | 00m00s [287/301] libedit-0:3.1-55.20250104cvs. 100% | 1.4 MiB/s | 105.3 KiB | 00m00s [288/301] libfido2-0:1.15.0-3.fc42.x86_ 100% | 1.3 MiB/s | 98.4 KiB | 00m00s [289/301] openssh-clients-0:10.0p1-3.fc 100% | 2.5 MiB/s | 746.7 KiB | 00m00s [290/301] openssh-0:10.0p1-3.fc43.x86_6 100% | 4.4 MiB/s | 339.5 KiB | 00m00s [291/301] rocm-clang-runtime-devel-0:19 100% | 18.5 MiB/s | 492.8 KiB | 00m00s [292/301] libcbor-0:0.11.0-3.fc42.x86_6 100% | 455.9 KiB/s | 33.3 KiB | 00m00s [293/301] rocm-libc++-devel-0:19-10.roc 100% | 22.1 MiB/s | 904.2 KiB | 00m00s [294/301] rocm-llvm-devel-0:19-10.rocm6 100% | 60.7 MiB/s | 3.8 MiB | 00m00s [295/301] zlib-ng-compat-devel-0:2.2.4- 100% | 525.1 KiB/s | 38.3 KiB | 00m00s [296/301] git-core-0:2.49.0-2.fc43.x86_ 100% | 4.7 MiB/s | 4.9 MiB | 00m01s [297/301] rocm-llvm-0:19-10.rocm6.4.1.f 100% | 73.4 MiB/s | 13.1 MiB | 00m00s [298/301] annobin-docs-0:12.96-1.fc43.n 100% | 1.2 MiB/s | 90.6 KiB | 00m00s [299/301] gcc-plugin-annobin-0:15.1.1-2 100% | 4.6 MiB/s | 52.3 KiB | 00m00s [300/301] cmake-rpm-macros-0:3.31.6-3.f 100% | 215.8 KiB/s | 15.8 KiB | 00m00s [301/301] annobin-plugin-gcc-0:12.96-1. 100% | 1.4 MiB/s | 982.2 KiB | 00m01s -------------------------------------------------------------------------------- [301/301] Total 100% | 19.1 MiB/s | 294.8 MiB | 00m15s Running transaction [ 1/303] Verify package files 100% | 502.0 B/s | 301.0 B | 00m01s [ 2/303] Prepare transaction 100% | 1.4 KiB/s | 301.0 B | 00m00s [ 3/303] Installing cmake-filesystem-0 100% | 1.9 MiB/s | 7.6 KiB | 00m00s [ 4/303] Installing less-0:678-1.fc43. 100% | 23.5 MiB/s | 409.1 KiB | 00m00s [ 5/303] Installing libmpc-0:1.3.1-7.f 100% | 81.1 MiB/s | 166.1 KiB | 00m00s [ 6/303] Installing make-1:4.4.1-10.fc 100% | 81.8 MiB/s | 1.8 MiB | 00m00s [ 7/303] Installing expat-0:2.7.1-1.fc 100% | 18.1 MiB/s | 296.3 KiB | 00m00s [ 8/303] Installing rocm-llvm-filesyst 100% | 2.4 MiB/s | 15.0 KiB | 00m00s [ 9/303] Installing rocm-libc++-0:19-1 100% | 56.0 MiB/s | 1.2 MiB | 00m00s [ 10/303] Installing rocm-llvm-libs-0:1 100% | 68.2 MiB/s | 84.7 MiB | 00m01s [ 11/303] Installing rocm-clang-libs-0: 100% | 70.6 MiB/s | 98.4 MiB | 00m01s [ 12/303] Installing kernel-headers-0:6 100% | 110.1 MiB/s | 6.8 MiB | 00m00s [ 13/303] Installing libxcrypt-devel-0: 100% | 16.2 MiB/s | 33.1 KiB | 00m00s [ 14/303] Installing glibc-devel-0:2.41 100% | 86.7 MiB/s | 2.3 MiB | 00m00s [ 15/303] Installing rocm-comgr-0:19-10 100% | 66.7 MiB/s | 123.9 MiB | 00m02s [ 16/303] Installing groff-base-0:1.23. 100% | 76.3 MiB/s | 3.9 MiB | 00m00s [ 17/303] Installing numactl-libs-0:2.0 100% | 52.5 MiB/s | 53.8 KiB | 00m00s [ 18/303] Installing vim-filesystem-2:9 100% | 2.3 MiB/s | 4.7 KiB | 00m00s [ 19/303] Installing rocm-lld-0:19-10.r 100% | 61.0 MiB/s | 5.7 MiB | 00m00s [ 20/303] Installing rocm-libc++-devel- 100% | 63.8 MiB/s | 7.7 MiB | 00m00s [ 21/303] Installing cpp-0:15.1.1-2.fc4 100% | 276.4 MiB/s | 37.9 MiB | 00m00s [ 22/303] Installing gcc-0:15.1.1-2.fc4 100% | 302.9 MiB/s | 111.2 MiB | 00m00s [ 23/303] Installing zlib-ng-compat-dev 100% | 106.0 MiB/s | 108.5 KiB | 00m00s [ 24/303] Installing annobin-docs-0:12. 100% | 48.8 MiB/s | 100.0 KiB | 00m00s [ 25/303] Installing rocm-clang-runtime 100% | 105.3 MiB/s | 6.9 MiB | 00m00s [ 26/303] Installing libcbor-0:0.11.0-3 100% | 77.3 MiB/s | 79.2 KiB | 00m00s [ 27/303] Installing libfido2-0:1.15.0- 100% | 119.0 MiB/s | 243.6 KiB | 00m00s [ 28/303] Installing openssh-0:10.0p1-3 100% | 73.3 MiB/s | 1.4 MiB | 00m00s [ 29/303] Installing libedit-0:3.1-55.2 100% | 120.0 MiB/s | 245.8 KiB | 00m00s [ 30/303] Installing openssh-clients-0: 100% | 72.5 MiB/s | 2.6 MiB | 00m00s [ 31/303] Installing git-core-0:2.49.0- 100% | 256.2 MiB/s | 22.8 MiB | 00m00s [ 32/303] Installing git-core-doc-0:2.4 100% | 191.2 MiB/s | 17.8 MiB | 00m00s [ 33/303] Installing libtommath-0:1.3.1 100% | 64.2 MiB/s | 131.5 KiB | 00m00s [ 34/303] Installing tcl-1:9.0.0-8.fc43 100% | 117.1 MiB/s | 4.3 MiB | 00m00s [ 35/303] Installing procps-ng-0:4.0.4- 100% | 48.1 MiB/s | 1.0 MiB | 00m00s [ 36/303] Installing systemtap-sdt-deve 100% | 60.0 MiB/s | 184.3 KiB | 00m00s [ 37/303] Installing libstdc++-devel-0: 100% | 216.2 MiB/s | 16.2 MiB | 00m00s [ 38/303] Installing gcc-c++-0:15.1.1-2 100% | 282.9 MiB/s | 41.3 MiB | 00m00s [ 39/303] Installing rocm-core-0:6.4.1- 100% | 2.6 MiB/s | 13.5 KiB | 00m00s [ 40/303] Installing tzdata-0:2025b-1.f 100% | 24.6 MiB/s | 1.9 MiB | 00m00s [ 41/303] Installing python-pip-wheel-0 100% | 415.0 MiB/s | 1.2 MiB | 00m00s [ 42/303] Installing mpdecimal-0:4.0.1- 100% | 30.5 MiB/s | 218.8 KiB | 00m00s [ 43/303] Installing python3-libs-0:3.1 100% | 205.0 MiB/s | 43.0 MiB | 00m00s [ 44/303] Installing python3-0:3.14.0~b 100% | 2.0 MiB/s | 30.6 KiB | 00m00s [ 45/303] Installing cmake-rpm-macros-0 100% | 8.1 MiB/s | 8.3 KiB | 00m00s [ 46/303] Installing python3-pyparsing- 100% | 171.6 MiB/s | 1.0 MiB | 00m00s [ 47/303] Installing systemtap-sdt-dtra 100% | 11.8 MiB/s | 180.9 KiB | 00m00s [ 48/303] Installing rocm-smi-0:6.4.1-1 100% | 120.7 MiB/s | 2.7 MiB | 00m00s [ 49/303] Installing hwdata-0:0.396-1.f 100% | 397.0 MiB/s | 9.5 MiB | 00m00s [ 50/303] Installing libpciaccess-0:0.1 100% | 44.8 MiB/s | 45.9 KiB | 00m00s [ 51/303] Installing libdrm-0:2.4.124-2 100% | 134.1 MiB/s | 411.8 KiB | 00m00s [ 52/303] Installing rocm-runtime-0:6.4 100% | 341.7 MiB/s | 3.1 MiB | 00m00s [ 53/303] Installing rocm-runtime-devel 100% | 187.1 MiB/s | 574.9 KiB | 00m00s [ 54/303] Installing rocm-llvm-0:19-10. 100% | 61.4 MiB/s | 48.5 MiB | 00m01s [ 55/303] Installing rocm-llvm-devel-0: 100% | 71.4 MiB/s | 25.7 MiB | 00m00s [ 56/303] Installing rocm-llvm-static-0 100% | 91.4 MiB/s | 250.2 MiB | 00m03s [ 57/303] Installing libpciaccess-devel 100% | 15.5 MiB/s | 15.9 KiB | 00m00s [ 58/303] Installing libdrm-devel-0:2.4 100% | 140.1 MiB/s | 717.5 KiB | 00m00s [ 59/303] Installing libpipeline-0:1.5. 100% | 6.5 MiB/s | 146.6 KiB | 00m00s [ 60/303] Installing man-db-0:2.13.1-1. 100% | 47.8 MiB/s | 2.9 MiB | 00m00s [ 61/303] Installing environment-module 100% | 39.2 MiB/s | 1.8 MiB | 00m00s [ 62/303] Installing ncurses-0:6.5-5.20 100% | 27.3 MiB/s | 614.7 KiB | 00m00s [ 63/303] Installing perl-Digest-0:1.20 100% | 18.1 MiB/s | 37.1 KiB | 00m00s [ 64/303] Installing perl-FileHandle-0: 100% | 9.5 MiB/s | 9.8 KiB | 00m00s [ 65/303] Installing perl-B-0:1.89-517. 100% | 122.4 MiB/s | 501.3 KiB | 00m00s [ 66/303] Installing perl-Digest-MD5-0: 100% | 20.0 MiB/s | 61.6 KiB | 00m00s [ 67/303] Installing perl-MIME-Base32-0 100% | 15.7 MiB/s | 32.2 KiB | 00m00s [ 68/303] Installing perl-libnet-0:3.15 100% | 71.9 MiB/s | 294.7 KiB | 00m00s [ 69/303] Installing perl-Data-Dumper-0 100% | 57.4 MiB/s | 117.5 KiB | 00m00s [ 70/303] Installing perl-URI-0:5.32-1. 100% | 44.6 MiB/s | 274.1 KiB | 00m00s [ 71/303] Installing perl-IO-Socket-IP- 100% | 49.9 MiB/s | 102.2 KiB | 00m00s [ 72/303] Installing perl-AutoLoader-0: 100% | 20.5 MiB/s | 20.9 KiB | 00m00s [ 73/303] Installing perl-Net-SSLeay-0: 100% | 123.5 MiB/s | 1.4 MiB | 00m00s [ 74/303] Installing perl-IO-Socket-SSL 100% | 139.8 MiB/s | 715.5 KiB | 00m00s [ 75/303] Installing perl-Pod-Escapes-1 100% | 25.3 MiB/s | 25.9 KiB | 00m00s [ 76/303] Installing perl-File-Path-0:2 100% | 63.0 MiB/s | 64.5 KiB | 00m00s [ 77/303] Installing perl-Time-Local-2: 100% | 34.5 MiB/s | 70.6 KiB | 00m00s [ 78/303] Installing perl-locale-0:1.12 100% | 6.7 MiB/s | 6.9 KiB | 00m00s [ 79/303] Installing perl-if-0:0.61.000 100% | 6.1 MiB/s | 6.2 KiB | 00m00s [ 80/303] Installing perl-Text-Tabs+Wra 100% | 11.7 MiB/s | 23.9 KiB | 00m00s [ 81/303] Installing perl-Pod-Simple-1: 100% | 112.3 MiB/s | 574.8 KiB | 00m00s [ 82/303] Installing perl-HTTP-Tiny-0:0 100% | 76.4 MiB/s | 156.4 KiB | 00m00s [ 83/303] Installing perl-Term-Cap-0:1. 100% | 29.9 MiB/s | 30.6 KiB | 00m00s [ 84/303] Installing perl-File-Temp-1:0 100% | 80.1 MiB/s | 164.1 KiB | 00m00s [ 85/303] Installing perl-IPC-Open3-0:1 100% | 22.7 MiB/s | 23.3 KiB | 00m00s [ 86/303] Installing perl-POSIX-0:2.20- 100% | 113.4 MiB/s | 232.3 KiB | 00m00s [ 87/303] Installing perl-Term-ANSIColo 100% | 48.4 MiB/s | 99.2 KiB | 00m00s [ 88/303] Installing perl-Class-Struct- 100% | 25.3 MiB/s | 25.9 KiB | 00m00s [ 89/303] Installing perl-podlators-1:6 100% | 16.5 MiB/s | 321.4 KiB | 00m00s [ 90/303] Installing perl-Pod-Perldoc-0 100% | 9.7 MiB/s | 169.2 KiB | 00m00s [ 91/303] Installing perl-File-stat-0:1 100% | 12.7 MiB/s | 13.1 KiB | 00m00s [ 92/303] Installing perl-Symbol-0:1.09 100% | 0.0 B/s | 7.2 KiB | 00m00s [ 93/303] Installing perl-SelectSaver-0 100% | 0.0 B/s | 2.6 KiB | 00m00s [ 94/303] Installing perl-Socket-4:2.03 100% | 59.6 MiB/s | 122.0 KiB | 00m00s [ 95/303] Installing perl-Pod-Usage-4:2 100% | 4.8 MiB/s | 87.9 KiB | 00m00s [ 96/303] Installing perl-overloading-0 100% | 5.4 MiB/s | 5.5 KiB | 00m00s [ 97/303] Installing perl-IO-0:1.55-517 100% | 49.2 MiB/s | 151.3 KiB | 00m00s [ 98/303] Installing perl-mro-0:1.29-51 100% | 41.6 MiB/s | 42.6 KiB | 00m00s [ 99/303] Installing perl-base-0:2.27-5 100% | 0.0 B/s | 12.9 KiB | 00m00s [100/303] Installing perl-Text-ParseWor 100% | 14.2 MiB/s | 14.6 KiB | 00m00s [101/303] Installing perl-Fcntl-0:1.18- 100% | 48.8 MiB/s | 50.0 KiB | 00m00s [102/303] Installing perl-Getopt-Long-1 100% | 71.9 MiB/s | 147.2 KiB | 00m00s [103/303] Installing perl-vars-0:1.05-5 100% | 0.0 B/s | 4.3 KiB | 00m00s [104/303] Installing perl-parent-1:0.24 100% | 10.7 MiB/s | 11.0 KiB | 00m00s [105/303] Installing perl-overload-0:1. 100% | 70.3 MiB/s | 71.9 KiB | 00m00s [106/303] Installing perl-Storable-1:3. 100% | 76.1 MiB/s | 233.9 KiB | 00m00s [107/303] Installing perl-constant-0:1. 100% | 26.7 MiB/s | 27.4 KiB | 00m00s [108/303] Installing perl-MIME-Base64-0 100% | 21.6 MiB/s | 44.3 KiB | 00m00s [109/303] Installing perl-Errno-0:1.38- 100% | 0.0 B/s | 8.7 KiB | 00m00s [110/303] Installing perl-File-Basename 100% | 14.2 MiB/s | 14.6 KiB | 00m00s [111/303] Installing perl-Scalar-List-U 100% | 36.3 MiB/s | 148.5 KiB | 00m00s [112/303] Installing perl-Getopt-Std-0: 100% | 11.5 MiB/s | 11.7 KiB | 00m00s [113/303] Installing perl-Encode-4:3.21 100% | 126.9 MiB/s | 4.7 MiB | 00m00s [114/303] Installing perl-DynaLoader-0: 100% | 31.7 MiB/s | 32.5 KiB | 00m00s [115/303] Installing perl-PathTools-0:3 100% | 45.1 MiB/s | 184.5 KiB | 00m00s [116/303] Installing perl-Exporter-0:5. 100% | 54.3 MiB/s | 55.6 KiB | 00m00s [117/303] Installing perl-Carp-0:1.54-5 100% | 15.5 MiB/s | 47.7 KiB | 00m00s [118/303] Installing perl-libs-4:5.40.2 100% | 128.4 MiB/s | 9.9 MiB | 00m00s [119/303] Installing perl-interpreter-4 100% | 6.9 MiB/s | 119.9 KiB | 00m00s [120/303] Installing perl-File-Find-0:1 100% | 41.5 MiB/s | 42.5 KiB | 00m00s [121/303] Installing perl-version-9:0.9 100% | 42.8 MiB/s | 131.5 KiB | 00m00s [122/303] Installing perl-File-Copy-0:2 100% | 19.7 MiB/s | 20.2 KiB | 00m00s [123/303] Installing perl-ExtUtils-Mani 100% | 84.3 MiB/s | 86.3 KiB | 00m00s [124/303] Installing perl-lib-0:0.65-51 100% | 8.7 MiB/s | 8.9 KiB | 00m00s [125/303] Installing perl-threads-1:2.4 100% | 57.2 MiB/s | 117.1 KiB | 00m00s [126/303] Installing perl-threads-share 100% | 41.9 MiB/s | 85.9 KiB | 00m00s [127/303] Installing perl-Compress-Raw- 100% | 53.9 MiB/s | 165.5 KiB | 00m00s [128/303] Installing perl-File-Compare- 100% | 6.0 MiB/s | 6.1 KiB | 00m00s [129/303] Installing perl-Time-HiRes-4: 100% | 38.3 MiB/s | 117.8 KiB | 00m00s [130/303] Installing perl-CPAN-Meta-Req 100% | 40.7 MiB/s | 83.4 KiB | 00m00s [131/303] Installing perl-Module-CoreLi 100% | 243.5 MiB/s | 1.2 MiB | 00m00s [132/303] Installing perl-Module-Metada 100% | 67.4 MiB/s | 69.0 KiB | 00m00s [133/303] Installing perl-Digest-SHA-1: 100% | 6.2 MiB/s | 115.0 KiB | 00m00s [134/303] Installing perl-Filter-2:1.64 100% | 27.0 MiB/s | 166.2 KiB | 00m00s [135/303] Installing perl-Module-Load-1 100% | 15.5 MiB/s | 15.9 KiB | 00m00s [136/303] Installing perl-Perl-OSType-0 100% | 16.7 MiB/s | 34.3 KiB | 00m00s [137/303] Installing perl-Term-ReadLine 100% | 17.4 MiB/s | 17.8 KiB | 00m00s [138/303] Installing perl-Tie-0:4.6-517 100% | 32.9 MiB/s | 33.7 KiB | 00m00s [139/303] Installing perl-Unicode-Norma 100% | 152.1 MiB/s | 467.4 KiB | 00m00s [140/303] Installing perl-meta-notation 100% | 2.2 MiB/s | 2.3 KiB | 00m00s [141/303] Installing perl-encoding-4:3. 100% | 73.5 MiB/s | 150.4 KiB | 00m00s [142/303] Installing perl-Net-Ping-0:2. 100% | 132.2 MiB/s | 135.3 KiB | 00m00s [143/303] Installing perl-ExtUtils-Comm 100% | 9.9 MiB/s | 10.2 KiB | 00m00s [144/303] Installing perl-Pod-Html-0:1. 100% | 2.7 MiB/s | 43.8 KiB | 00m00s [145/303] Installing perl-File-Which-0: 100% | 30.7 MiB/s | 31.4 KiB | 00m00s [146/303] Installing perl-AutoSplit-0:5 100% | 23.0 MiB/s | 23.5 KiB | 00m00s [147/303] Installing perl-Benchmark-0:1 100% | 35.9 MiB/s | 36.7 KiB | 00m00s [148/303] Installing perl-Test-Harness- 100% | 20.3 MiB/s | 583.4 KiB | 00m00s [149/303] Installing perl-CPAN-Meta-YAM 100% | 52.3 MiB/s | 53.5 KiB | 00m00s [150/303] Installing perl-Compress-Raw- 100% | 6.2 MiB/s | 69.6 KiB | 00m00s [151/303] Installing perl-IO-Compress-0 100% | 41.2 MiB/s | 1.0 MiB | 00m00s [152/303] Installing perl-IO-Zlib-1:1.1 100% | 26.1 MiB/s | 26.7 KiB | 00m00s [153/303] Installing perl-Devel-PPPort- 100% | 218.4 MiB/s | 894.5 KiB | 00m00s [154/303] Installing perl-DirHandle-0:1 100% | 0.0 B/s | 3.8 KiB | 00m00s [155/303] Installing perl-Dumpvalue-0:2 100% | 19.7 MiB/s | 20.2 KiB | 00m00s [156/303] Installing perl-ExtUtils-Cons 100% | 85.5 MiB/s | 87.6 KiB | 00m00s [157/303] Installing perl-ExtUtils-MM-U 100% | 3.6 MiB/s | 3.7 KiB | 00m00s [158/303] Installing perl-Hash-Util-Fie 100% | 31.4 MiB/s | 64.3 KiB | 00m00s [159/303] Installing perl-Hash-Util-0:0 100% | 55.0 MiB/s | 56.4 KiB | 00m00s [160/303] Installing perl-fields-0:2.27 100% | 12.0 MiB/s | 12.2 KiB | 00m00s [161/303] Installing perl-ExtUtils-Pars 100% | 22.7 MiB/s | 489.0 KiB | 00m00s [162/303] Installing perl-ExtUtils-Make 100% | 34.9 MiB/s | 750.3 KiB | 00m00s [163/303] Installing perl-ExtUtils-Inst 100% | 42.6 MiB/s | 87.2 KiB | 00m00s [164/303] Installing perl-devel-4:5.40. 100% | 196.4 MiB/s | 8.1 MiB | 00m00s [165/303] Installing perl-ExtUtils-Embe 100% | 15.7 MiB/s | 16.1 KiB | 00m00s [166/303] Installing perl-I18N-LangTags 100% | 81.6 MiB/s | 83.6 KiB | 00m00s [167/303] Installing perl-Locale-Makete 100% | 84.9 MiB/s | 173.9 KiB | 00m00s [168/303] Installing perl-Locale-Makete 100% | 13.1 MiB/s | 13.5 KiB | 00m00s [169/303] Installing perl-Params-Check- 100% | 27.9 MiB/s | 28.6 KiB | 00m00s [170/303] Installing perl-Module-Load-C 100% | 29.2 MiB/s | 29.9 KiB | 00m00s [171/303] Installing perl-IPC-Cmd-2:1.0 100% | 83.9 MiB/s | 85.9 KiB | 00m00s [172/303] Installing perl-ExtUtils-CBui 100% | 33.1 MiB/s | 101.7 KiB | 00m00s [173/303] Installing perl-Math-Complex- 100% | 83.8 MiB/s | 85.8 KiB | 00m00s [174/303] Installing perl-Math-BigInt-1 100% | 212.8 MiB/s | 1.1 MiB | 00m00s [175/303] Installing perl-JSON-PP-1:4.1 100% | 8.2 MiB/s | 143.6 KiB | 00m00s [176/303] Installing perl-CPAN-Meta-0:2 100% | 54.5 MiB/s | 613.8 KiB | 00m00s [177/303] Installing perl-NDBM_File-0:1 100% | 28.9 MiB/s | 29.6 KiB | 00m00s [178/303] Installing perl-SelfLoader-0: 100% | 0.0 B/s | 22.8 KiB | 00m00s [179/303] Installing perl-Sys-Hostname- 100% | 16.8 MiB/s | 17.2 KiB | 00m00s [180/303] Installing perl-Term-Table-0: 100% | 39.6 MiB/s | 81.1 KiB | 00m00s [181/303] Installing perl-Text-Balanced 100% | 110.1 MiB/s | 112.7 KiB | 00m00s [182/303] Installing perl-Tie-RefHash-0 100% | 36.5 MiB/s | 37.4 KiB | 00m00s [183/303] Installing perl-User-pwent-0: 100% | 17.4 MiB/s | 17.9 KiB | 00m00s [184/303] Installing perl-autouse-0:1.1 100% | 6.2 MiB/s | 6.3 KiB | 00m00s [185/303] Installing perl-subs-0:1.04-5 100% | 2.4 MiB/s | 2.5 KiB | 00m00s [186/303] Installing perl-Opcode-0:1.65 100% | 48.7 MiB/s | 49.9 KiB | 00m00s [187/303] Installing perl-Safe-0:2.46-5 100% | 30.3 MiB/s | 31.0 KiB | 00m00s [188/303] Installing perl-Params-Util-0 100% | 29.8 MiB/s | 61.0 KiB | 00m00s [189/303] Installing perl-Sub-Install-0 100% | 36.3 MiB/s | 37.2 KiB | 00m00s [190/303] Installing perl-Data-OptList- 100% | 51.0 MiB/s | 52.2 KiB | 00m00s [191/303] Installing perl-Filter-Simple 100% | 16.8 MiB/s | 51.7 KiB | 00m00s [192/303] Installing perl-Test-Simple-3 100% | 59.0 MiB/s | 1.8 MiB | 00m00s [193/303] Installing perl-Devel-SelfStu 100% | 7.1 MiB/s | 7.3 KiB | 00m00s [194/303] Installing perl-Memoize-0:1.1 100% | 65.0 MiB/s | 66.5 KiB | 00m00s [195/303] Installing perl-Math-BigInt-F 100% | 22.9 MiB/s | 46.9 KiB | 00m00s [196/303] Installing perl-bignum-0:0.67 100% | 44.4 MiB/s | 136.5 KiB | 00m00s [197/303] Installing perl-File-Fetch-0: 100% | 59.9 MiB/s | 61.3 KiB | 00m00s [198/303] Installing perl-ExtUtils-Mini 100% | 8.6 MiB/s | 8.8 KiB | 00m00s [199/303] Installing perl-inc-latest-2: 100% | 17.7 MiB/s | 36.3 KiB | 00m00s [200/303] Installing perl-libnetcfg-4:5 100% | 1.1 MiB/s | 17.3 KiB | 00m00s [201/303] Installing perl-DBM_Filter-0: 100% | 29.8 MiB/s | 30.5 KiB | 00m00s [202/303] Installing perl-File-HomeDir- 100% | 60.5 MiB/s | 123.8 KiB | 00m00s [203/303] Installing perl-open-0:1.13-5 100% | 0.0 B/s | 11.7 KiB | 00m00s [204/303] Installing perl-debugger-0:1. 100% | 196.9 MiB/s | 403.3 KiB | 00m00s [205/303] Installing perl-sigtrap-0:1.1 100% | 11.2 MiB/s | 11.4 KiB | 00m00s [206/303] Installing perl-Unicode-Colla 100% | 199.8 MiB/s | 4.2 MiB | 00m00s [207/303] Installing perl-Unicode-UCD-0 100% | 100.1 MiB/s | 205.0 KiB | 00m00s [208/303] Installing perl-Env-0:1.06-51 100% | 26.6 MiB/s | 27.2 KiB | 00m00s [209/303] Installing perl-Module-CoreLi 100% | 1.2 MiB/s | 19.3 KiB | 00m00s [210/303] Installing perl-Archive-Zip-0 100% | 15.3 MiB/s | 297.8 KiB | 00m00s [211/303] Installing perl-Thread-0:3.05 100% | 12.2 MiB/s | 12.5 KiB | 00m00s [212/303] Installing perl-Thread-Queue- 100% | 29.7 MiB/s | 30.4 KiB | 00m00s [213/303] Installing perl-Thread-Semaph 100% | 10.3 MiB/s | 10.6 KiB | 00m00s [214/303] Installing perl-experimental- 100% | 41.9 MiB/s | 42.9 KiB | 00m00s [215/303] Installing perl-Encode-devel- 100% | 6.2 MiB/s | 101.1 KiB | 00m00s [216/303] Installing perl-Pod-Checker-4 100% | 3.1 MiB/s | 53.5 KiB | 00m00s [217/303] Installing perl-diagnostics-0 100% | 28.5 MiB/s | 466.5 KiB | 00m00s [218/303] Installing perl-macros-4:5.40 100% | 0.0 B/s | 5.8 KiB | 00m00s [219/303] Installing perl-utils-0:5.40. 100% | 6.0 MiB/s | 98.5 KiB | 00m00s [220/303] Installing perl-Attribute-Han 100% | 39.5 MiB/s | 40.5 KiB | 00m00s [221/303] Installing perl-Config-Extens 100% | 3.1 MiB/s | 3.2 KiB | 00m00s [222/303] Installing perl-Config-Perl-V 100% | 13.4 MiB/s | 27.5 KiB | 00m00s [223/303] Installing perl-Devel-Peek-0: 100% | 43.8 MiB/s | 44.9 KiB | 00m00s [224/303] Installing perl-English-0:1.1 100% | 6.5 MiB/s | 6.6 KiB | 00m00s [225/303] Installing perl-File-DosGlob- 100% | 21.7 MiB/s | 22.2 KiB | 00m00s [226/303] Installing perl-FileCache-0:1 100% | 7.7 MiB/s | 7.9 KiB | 00m00s [227/303] Installing perl-FindBin-0:1.5 100% | 0.0 B/s | 7.1 KiB | 00m00s [228/303] Installing perl-GDBM_File-1:1 100% | 78.8 MiB/s | 80.7 KiB | 00m00s [229/303] Installing perl-I18N-Collate- 100% | 0.0 B/s | 7.6 KiB | 00m00s [230/303] Installing perl-I18N-Langinfo 100% | 35.3 MiB/s | 36.1 KiB | 00m00s [231/303] Installing perl-IPC-SysV-0:2. 100% | 37.4 MiB/s | 76.7 KiB | 00m00s [232/303] Installing perl-Module-Loaded 100% | 0.0 B/s | 5.5 KiB | 00m00s [233/303] Installing perl-NEXT-0:0.69-5 100% | 0.0 B/s | 23.9 KiB | 00m00s [234/303] Installing perl-Net-0:1.04-51 100% | 23.2 MiB/s | 23.7 KiB | 00m00s [235/303] Installing perl-ODBM_File-0:1 100% | 28.8 MiB/s | 29.4 KiB | 00m00s [236/303] Installing perl-PerlIO-via-Qu 100% | 31.4 MiB/s | 32.1 KiB | 00m00s [237/303] Installing perl-Pod-Functions 100% | 14.3 MiB/s | 14.6 KiB | 00m00s [238/303] Installing perl-Search-Dict-0 100% | 5.1 MiB/s | 5.2 KiB | 00m00s [239/303] Installing perl-Sys-Syslog-0: 100% | 47.3 MiB/s | 96.9 KiB | 00m00s [240/303] Installing perl-Term-Complete 100% | 0.0 B/s | 6.3 KiB | 00m00s [241/303] Installing perl-Test-0:1.31-5 100% | 0.0 B/s | 37.4 KiB | 00m00s [242/303] Installing perl-Text-Abbrev-0 100% | 3.5 MiB/s | 3.6 KiB | 00m00s [243/303] Installing perl-Tie-File-0:1. 100% | 84.2 MiB/s | 86.2 KiB | 00m00s [244/303] Installing perl-Tie-Memoize-0 100% | 0.0 B/s | 6.7 KiB | 00m00s [245/303] Installing perl-Time-0:1.04-5 100% | 10.5 MiB/s | 10.8 KiB | 00m00s [246/303] Installing perl-Time-Piece-0: 100% | 35.5 MiB/s | 72.7 KiB | 00m00s [247/303] Installing perl-blib-0:1.07-5 100% | 3.5 MiB/s | 3.6 KiB | 00m00s [248/303] Installing perl-deprecate-0:0 100% | 3.4 MiB/s | 6.9 KiB | 00m00s [249/303] Installing perl-doc-0:5.40.2- 100% | 221.7 MiB/s | 11.1 MiB | 00m00s [250/303] Installing perl-encoding-warn 100% | 0.0 B/s | 10.7 KiB | 00m00s [251/303] Installing perl-filetest-0:1. 100% | 0.0 B/s | 6.8 KiB | 00m00s [252/303] Installing perl-less-0:0.03-5 100% | 0.0 B/s | 5.3 KiB | 00m00s [253/303] Installing perl-perlfaq-0:5.2 100% | 179.9 MiB/s | 736.9 KiB | 00m00s [254/303] Installing perl-ph-0:5.40.2-5 100% | 89.8 MiB/s | 275.9 KiB | 00m00s [255/303] Installing perl-sort-0:2.05-5 100% | 0.0 B/s | 5.2 KiB | 00m00s [256/303] Installing perl-vmsish-0:1.04 100% | 6.8 MiB/s | 6.9 KiB | 00m00s [257/303] Installing perl-Compress-Bzip 100% | 47.3 MiB/s | 145.3 KiB | 00m00s [258/303] Installing perl-Devel-Size-0: 100% | 42.8 MiB/s | 43.8 KiB | 00m00s [259/303] Installing perl-Text-Glob-0:0 100% | 9.1 MiB/s | 9.3 KiB | 00m00s [260/303] Installing perl-local-lib-0:2 100% | 58.8 MiB/s | 120.4 KiB | 00m00s [261/303] Installing perl-IPC-System-Si 100% | 71.8 MiB/s | 73.5 KiB | 00m00s [262/303] Installing perl-autodie-0:2.3 100% | 53.5 MiB/s | 219.1 KiB | 00m00s [263/303] Installing perl-Compress-Raw- 100% | 60.2 MiB/s | 123.3 KiB | 00m00s [264/303] Installing perl-IO-Compress-L 100% | 53.8 MiB/s | 220.4 KiB | 00m00s [265/303] Installing perl-Algorithm-Dif 100% | 53.5 MiB/s | 109.5 KiB | 00m00s [266/303] Installing perl-Text-Diff-0:1 100% | 41.5 MiB/s | 85.1 KiB | 00m00s [267/303] Installing perl-Archive-Tar-0 100% | 9.0 MiB/s | 156.9 KiB | 00m00s [268/303] Installing perl-Module-Signat 100% | 7.7 MiB/s | 141.8 KiB | 00m00s [269/303] Installing perl-Text-Template 100% | 55.7 MiB/s | 114.0 KiB | 00m00s [270/303] Installing perl-MRO-Compat-0: 100% | 21.9 MiB/s | 44.9 KiB | 00m00s [271/303] Installing perl-Package-Gener 100% | 30.8 MiB/s | 31.5 KiB | 00m00s [272/303] Installing perl-Sub-Exporter- 100% | 65.7 MiB/s | 201.9 KiB | 00m00s [273/303] Installing perl-Data-Section- 100% | 43.0 MiB/s | 44.1 KiB | 00m00s [274/303] Installing perl-Software-Lice 100% | 83.5 MiB/s | 513.1 KiB | 00m00s [275/303] Installing perl-Module-Build- 100% | 29.4 MiB/s | 663.2 KiB | 00m00s [276/303] Installing perl-TermReadKey-0 100% | 21.5 MiB/s | 66.2 KiB | 00m00s [277/303] Installing perl-Error-1:0.170 100% | 39.0 MiB/s | 80.0 KiB | 00m00s [278/303] Installing git-0:2.49.0-2.fc4 100% | 85.4 MiB/s | 87.5 KiB | 00m00s [279/303] Installing perl-Git-0:2.49.0- 100% | 63.5 MiB/s | 65.0 KiB | 00m00s [280/303] Installing rocm-clang-0:19-10 100% | 62.0 MiB/s | 70.2 MiB | 00m01s [281/303] Installing rocm-clang-devel-0 100% | 81.2 MiB/s | 23.5 MiB | 00m00s [282/303] Installing rocm-device-libs-0 100% | 68.4 MiB/s | 3.2 MiB | 00m00s [283/303] Installing rocm-comgr-devel-0 100% | 48.6 MiB/s | 99.6 KiB | 00m00s [284/303] Installing hipcc-0:19-10.rocm 100% | 23.7 MiB/s | 654.3 KiB | 00m00s [285/303] Installing rocm-hip-0:6.4.1-2 100% | 274.1 MiB/s | 24.9 MiB | 00m00s [286/303] Installing libdb-0:5.3.28-65. 100% | 231.8 MiB/s | 1.9 MiB | 00m00s [287/303] Installing perl-DB_File-0:1.8 100% | 93.1 MiB/s | 190.6 KiB | 00m00s [288/303] Installing perl-CPAN-0:2.38-4 100% | 65.4 MiB/s | 1.9 MiB | 00m00s [289/303] Installing perl-4:5.40.2-517. 100% | 121.1 KiB/s | 124.0 B | 00m00s [290/303] Installing emacs-filesystem-1 100% | 531.2 KiB/s | 544.0 B | 00m00s [291/303] Installing rhash-0:1.4.5-2.fc 100% | 18.3 MiB/s | 356.4 KiB | 00m00s [292/303] Installing libuv-1:1.51.0-1.f 100% | 139.9 MiB/s | 573.0 KiB | 00m00s [293/303] Installing jsoncpp-0:1.9.6-1. 100% | 128.5 MiB/s | 263.1 KiB | 00m00s [294/303] Installing cmake-0:3.31.6-3.f 100% | 218.4 MiB/s | 34.5 MiB | 00m00s [295/303] Installing cmake-data-0:3.31. 100% | 44.7 MiB/s | 9.1 MiB | 00m00s [296/303] Installing rocm-cmake-0:6.4.0 100% | 44.1 MiB/s | 135.6 KiB | 00m00s [297/303] Installing hipify-0:6.4.1-2.f 100% | 110.3 MiB/s | 3.1 MiB | 00m00s [298/303] Installing rocm-hip-devel-0:6 100% | 98.9 MiB/s | 2.8 MiB | 00m00s [299/303] Installing rocm-rpm-macros-0: 100% | 19.0 MiB/s | 19.5 KiB | 00m00s [300/303] Installing rocm-smi-devel-0:6 100% | 138.7 MiB/s | 284.0 KiB | 00m00s [301/303] Installing rocm-core-devel-0: 100% | 15.8 MiB/s | 16.1 KiB | 00m00s [302/303] Installing annobin-plugin-gcc 100% | 36.0 MiB/s | 995.3 KiB | 00m00s [303/303] Installing gcc-plugin-annobin 100% | 200.6 KiB/s | 58.8 KiB | 00m00s Warning: skipped OpenPGP checks for 29 packages from repository: copr_base Complete! Finish: build setup for rccl-6.4.1-3.fc43.src.rpm Start: rpmbuild rccl-6.4.1-3.fc43.src.rpm Building target platforms: x86_64 Building for target x86_64 setting SOURCE_DATE_EPOCH=1750118400 Executing(%mkbuilddir): /bin/sh -e /var/tmp/rpm-tmp.2X04xj Executing(%prep): /bin/sh -e /var/tmp/rpm-tmp.bZi3Mg + umask 022 + cd /builddir/build/BUILD/rccl-6.4.1-build + cd /builddir/build/BUILD/rccl-6.4.1-build + rm -rf rccl-rocm-6.4.1 + /usr/lib/rpm/rpmuncompress -x /builddir/build/SOURCES/RCCL-6.4.1.tar.gz + STATUS=0 + '[' 0 -ne 0 ']' + cd rccl-rocm-6.4.1 + /usr/bin/chmod -Rf a+rX,u+w,g-w,o-w . + sed -i -e '/AMD GPU targets to compile for/d' CMakeLists.txt + sed -i -e 's@cat ${ROCM_PATH}/.info/version@echo 6.4.1@' CMakeLists.txt + sed -i -e s@rocm-core/rocm_version.h@rocm_version.h@ src/include/hip_rocm_version_info.h + sed -i -e 's@if (ENABLE_MSCCLPP AND NOT(${HOST_OS_ID} STREQUAL "ubuntu" OR ${HOST_OS_ID} STREQUAL "centos"))@if (ENABLE_MSCCLPP)@' CMakeLists.txt + sed -i '/#include ' test/common/TestBed.hpp + RPM_EC=0 ++ jobs -p + exit 0 Executing(%build): /bin/sh -e /var/tmp/rpm-tmp.Z6pgcY + umask 022 + cd /builddir/build/BUILD/rccl-6.4.1-build + CFLAGS='-O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer ' + export CFLAGS + CXXFLAGS='-O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer' + export CXXFLAGS + FFLAGS='-O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -I/usr/lib64/gfortran/modules ' + export FFLAGS + FCFLAGS='-O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -I/usr/lib64/gfortran/modules ' + export FCFLAGS + VALAFLAGS=-g + export VALAFLAGS + RUSTFLAGS='-Copt-level=3 -Cdebuginfo=2 -Ccodegen-units=1 -Cstrip=none -Cforce-frame-pointers=yes -Clink-arg=-specs=/usr/lib/rpm/redhat/redhat-package-notes --cap-lints=warn' + export RUSTFLAGS + LDFLAGS='-Wl,-z,relro -Wl,--as-needed -Wl,-z,pack-relative-relocs -Wl,-z,now -Wl,-z,now -Wl,--build-id=sha1 -specs=/usr/lib/rpm/redhat/redhat-package-notes ' + export LDFLAGS + LT_SYS_LIBRARY_PATH=/usr/lib64: + export LT_SYS_LIBRARY_PATH + CC=hipcc + export CC + CXX=hipcc + export CXX + cd rccl-rocm-6.4.1 + CFLAGS='-O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer ' + export CFLAGS + CXXFLAGS='-O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer' + export CXXFLAGS + FFLAGS='-O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -I/usr/lib64/gfortran/modules ' + export FFLAGS + FCFLAGS='-O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -I/usr/lib64/gfortran/modules ' + export FCFLAGS + VALAFLAGS=-g + export VALAFLAGS + RUSTFLAGS='-Copt-level=3 -Cdebuginfo=2 -Ccodegen-units=1 -Cstrip=none -Cforce-frame-pointers=yes -Clink-arg=-specs=/usr/lib/rpm/redhat/redhat-package-notes --cap-lints=warn' + export RUSTFLAGS + LDFLAGS='-Wl,-z,relro -Wl,--as-needed -Wl,-z,pack-relative-relocs -Wl,-z,now -Wl,-z,now -Wl,--build-id=sha1 -specs=/usr/lib/rpm/redhat/redhat-package-notes ' + export LDFLAGS + LT_SYS_LIBRARY_PATH=/usr/lib64: + export LT_SYS_LIBRARY_PATH + CC=hipcc + export CC + CXX=hipcc + export CXX + /usr/bin/cmake -S . -B redhat-linux-build -DCMAKE_C_FLAGS_RELEASE:STRING=-DNDEBUG -DCMAKE_CXX_FLAGS_RELEASE:STRING=-DNDEBUG -DCMAKE_Fortran_FLAGS_RELEASE:STRING=-DNDEBUG -DCMAKE_VERBOSE_MAKEFILE:BOOL=ON -DCMAKE_INSTALL_DO_STRIP:BOOL=OFF -DCMAKE_INSTALL_PREFIX:PATH=/usr -DCMAKE_INSTALL_FULL_SBINDIR:PATH=/usr/bin -DCMAKE_INSTALL_SBINDIR:PATH=bin -DINCLUDE_INSTALL_DIR:PATH=/usr/include -DLIB_INSTALL_DIR:PATH=/usr/lib64 -DSYSCONF_INSTALL_DIR:PATH=/etc -DSHARE_INSTALL_PREFIX:PATH=/usr/share -DLIB_SUFFIX=64 -DBUILD_SHARED_LIBS:BOOL=ON '-DAMDGPU_TARGETS=gfx90a:xnack+;gfx90a:xnack-;gfx1100;gfx1101;gfx1102;gfx1200;gfx1201' -DBUILD_FILE_REORG_BACKWARD_COMPATIBILITY=OFF -DBUILD_TESTS=OFF -DCMAKE_BUILD_TYPE=RelWithDebInfo -DCMAKE_C_COMPILER=/usr/bin/hipcc -DCMAKE_CXX_COMPILER=/usr/bin/hipcc -DCMAKE_EXPORT_COMPILE_COMMANDS=OFF -DCMAKE_INSTALL_LIBDIR=/usr/lib64 -DCMAKE_SKIP_RPATH=ON -DENABLE_MSCCLPP=OFF -DHIP_PLATFORM=amd -DRCCL_ROCPROFILER_REGISTER=OFF -DROCM_PATH=/usr -DROCM_SYMLINK_LIBS=OFF CMake Deprecation Warning at CMakeLists.txt:6 (cmake_minimum_required): Compatibility with CMake < 3.10 will be removed from a future version of CMake. Update the VERSION argument value. Or, use the ... syntax to tell CMake that the project requires at least but has been updated to work with policies introduced by or earlier. -- CMAKE_TOOLCHAIN_FILE: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/toolchain-linux.cmake -- The CXX compiler identification is Clang 19.0.0 -- Detecting CXX compiler ABI info -- Detecting CXX compiler ABI info - done -- Check for working CXX compiler: /usr/bin/hipcc - skipped -- Detecting CXX compile features -- Detecting CXX compile features - done -- Could NOT find GTest (missing: GTEST_LIBRARY GTEST_INCLUDE_DIR GTEST_MAIN_LIBRARY) (Required is at least version "1.11") CMake Deprecation Warning at /usr/share/rocm/cmake/ROCMConfig.cmake:12 (message): Use of find_package(ROCM) is deprecated as of ROCm 6.4. Please use find_package(ROCmCMakeBuildTools) Call Stack (most recent call first): cmake/Dependencies.cmake:75 (find_package) CMakeLists.txt:55 (include) -- Checking for ROCm support for GPU targets: gfx906;gfx908;gfx90a;gfx942;gfx1030;gfx1100;gfx1101;gfx1102;gfx1200;gfx1201 -- Performing Test COMPILER_HAS_TARGET_ID_gfx906 -- Performing Test COMPILER_HAS_TARGET_ID_gfx906 - Success -- Performing Test COMPILER_HAS_TARGET_ID_gfx908 -- Performing Test COMPILER_HAS_TARGET_ID_gfx908 - Success -- Performing Test COMPILER_HAS_TARGET_ID_gfx90a -- Performing Test COMPILER_HAS_TARGET_ID_gfx90a - Success -- Performing Test COMPILER_HAS_TARGET_ID_gfx942 -- Performing Test COMPILER_HAS_TARGET_ID_gfx942 - Success -- Performing Test COMPILER_HAS_TARGET_ID_gfx1030 -- Performing Test COMPILER_HAS_TARGET_ID_gfx1030 - Success -- Performing Test COMPILER_HAS_TARGET_ID_gfx1100 -- Performing Test COMPILER_HAS_TARGET_ID_gfx1100 - Success -- Performing Test COMPILER_HAS_TARGET_ID_gfx1101 -- Performing Test COMPILER_HAS_TARGET_ID_gfx1101 - Success -- Performing Test COMPILER_HAS_TARGET_ID_gfx1102 -- Performing Test COMPILER_HAS_TARGET_ID_gfx1102 - Success -- Performing Test COMPILER_HAS_TARGET_ID_gfx1200 -- Performing Test COMPILER_HAS_TARGET_ID_gfx1200 - Success -- Performing Test COMPILER_HAS_TARGET_ID_gfx1201 -- Performing Test COMPILER_HAS_TARGET_ID_gfx1201 - Success -- Compiling for gfx906;gfx908;gfx90a;gfx942;gfx1030;gfx1100;gfx1101;gfx1102;gfx1200;gfx1201 -- Could NOT find GTest (missing: GTEST_LIBRARY GTEST_INCLUDE_DIR GTEST_MAIN_LIBRARY) (Required is at least version "1.11") CMake Deprecation Warning at /usr/share/rocm/cmake/ROCMConfig.cmake:12 (message): Use of find_package(ROCM) is deprecated as of ROCm 6.4. Please use find_package(ROCmCMakeBuildTools) Call Stack (most recent call first): cmake/Dependencies.cmake:75 (find_package) CMakeLists.txt:102 (include) -- ROCM_PATH found: /usr -- Compiling with hipcc -- Performing Test CMAKE_HAVE_LIBC_PTHREAD -- Performing Test CMAKE_HAVE_LIBC_PTHREAD - Success -- Found Threads: TRUE -- Performing Test HIP_CLANG_SUPPORTS_PARALLEL_JOBS -- Performing Test HIP_CLANG_SUPPORTS_PARALLEL_JOBS - Success -- HIP compiler: clang -- HIP runtime: rocclr -- hipcc executable: /usr/bin/hipcc sh: line 1: /usr/bin/rocm_agent_enumerator: No such file or directory -- hipcc version: 6.4.43483 -- hipconfig executable: /usr/bin/hipconfig -- hipcc HIP version: 6.4.43483 -- ROCm version: 6.4.1 -- Looking for hipDeviceMallocUncached -- Looking for hipDeviceMallocUncached - found -- Looking for hipDeviceMallocContiguous -- Looking for hipDeviceMallocContiguous - found -- RCCL LL128 protocol enabled -- HSA runtime: /usr/include -- Found rocm_smi at /usr/include -- Looking for C++ include /usr/include/rocm_smi/rocm_smi64Config.h -- Looking for C++ include /usr/include/rocm_smi/rocm_smi64Config.h - found -- RSMI_INIT_FLAG_THRAD_ONLY_MUTEX supported -- Performing Test HAVE_KERNARG_PRELOAD -- Performing Test HAVE_KERNARG_PRELOAD - Success -- Kernarg preloading to SGPR enabled -- Performing Test HAVE_PARALLEL_JOBS -- Performing Test HAVE_PARALLEL_JOBS - Success -- Parallel jobs enabled CMake Warning at CMakeLists.txt:331 (message): ROCTX library not found. Skipping ROCTX linking. -- Found Python3: /usr/bin/python3.14 (found version "3.14.0") found components: Interpreter -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/device_table.h -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/device_table.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/host_table.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp -- Generating /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp -- HIP_CONTIGUOUS_MEMORY enabled -- HIP_UNCACHED_MEMORY enabled -- Use 1 jobs for linking -- Building shared RCCL library -- rocm-cmake: Set license file to /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/LICENSE.txt. -- Configuring done (44.5s) -- Generating done (0.1s) CMake Warning: Manually-specified variables were not used by the project: AMDGPU_TARGETS CMAKE_CXX_FLAGS_RELEASE CMAKE_C_FLAGS_RELEASE CMAKE_Fortran_FLAGS_RELEASE CMAKE_INSTALL_DO_STRIP LIB_SUFFIX SHARE_INSTALL_PREFIX SYSCONF_INSTALL_DIR -- Build files have been written to: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build + /usr/bin/cmake --build redhat-linux-build -j2 --verbose Change Dir: '/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build' Run Build Command(s): /usr/bin/cmake -E env VERBOSE=1 /usr/bin/gmake -f Makefile -j2 /usr/bin/cmake -S/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1 -B/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build --check-build-system CMakeFiles/Makefile.cmake 0 /usr/bin/cmake -E cmake_progress_start /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/CMakeFiles /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build//CMakeFiles/progress.marks /usr/bin/gmake -f CMakeFiles/Makefile2 all gmake[1]: Entering directory '/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build' /usr/bin/gmake -f CMakeFiles/git_version_check.dir/build.make CMakeFiles/git_version_check.dir/depend gmake[2]: Entering directory '/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build' cd /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build && /usr/bin/cmake -E cmake_depends "Unix Makefiles" /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1 /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1 /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/CMakeFiles/git_version_check.dir/DependInfo.cmake "--color=" gmake[2]: Leaving directory '/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build' /usr/bin/gmake -f CMakeFiles/git_version_check.dir/build.make CMakeFiles/git_version_check.dir/build gmake[2]: Entering directory '/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build' [ 0%] Updating git_version.cpp if necessary /usr/bin/cmake -P /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/git_version.cmake -- Updating git_version.cpp gmake[2]: Leaving directory '/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build' [ 0%] Built target git_version_check /usr/bin/gmake -f CMakeFiles/rccl.dir/build.make CMakeFiles/rccl.dir/depend gmake[2]: Entering directory '/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build' [ 0%] Hipifying src/transport/shm.cc -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/shm.cc mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/transport/shm.cc -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/shm.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/shm.cc [ 0%] Hipifying src/bootstrap.cc -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/bootstrap.cc mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/bootstrap.cc -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/bootstrap.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/bootstrap.cc [ 0%] Hipifying src/channel.cc -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/channel.cc mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/channel.cc -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/channel.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/channel.cc [ 0%] Hipifying src/collectives.cc -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/collectives.cc -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc [ 1%] Hipifying src/debug.cc -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/debug.cc mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/debug.cc -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/debug.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/debug.cc [ 1%] Hipifying src/device/all_gather.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/device/all_gather.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h [ 1%] Hipifying src/device/all_reduce.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/device/all_reduce.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h Added COLL_UNROLL template argument to /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h [ 2%] Hipifying src/device/alltoall_pivot.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/device/alltoall_pivot.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h Added COLL_UNROLL template argument to /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h [ 2%] Hipifying src/device/broadcast.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/device/broadcast.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h Added COLL_UNROLL template argument to /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h [ 2%] Hipifying src/device/common.cu -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.cu.cpp mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/device/common.cu -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.cu.cpp && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.cu.cpp Added COLL_UNROLL template argument to /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h [ 2%] Hipifying src/device/common.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/device/common.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h [ 2%] Hipifying src/device/common_kernel.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common_kernel.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/device/common_kernel.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common_kernel.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common_kernel.h Added COLL_UNROLL template argument to /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h [ 2%] Hipifying src/device/msccl_kernel_impl.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/device/msccl_kernel_impl.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h Added COLL_UNROLL template argument to /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common_kernel.h [ 3%] Hipifying src/device/network/unpack/unpack.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack/unpack.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/device/network/unpack/unpack.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack/unpack.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack/unpack.h Added COLL_UNROLL template argument to /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h Added COLL_UNROLL template argument to /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack/unpack.h [ 3%] Hipifying src/device/network/unpack/unpack_defs.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack/unpack_defs.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/device/network/unpack/unpack_defs.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack/unpack_defs.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack/unpack_defs.h [ 3%] Hipifying src/device/onerank.cu -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/onerank.cu.cpp mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/device/onerank.cu -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/onerank.cu.cpp && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/onerank.cu.cpp Added COLL_UNROLL template argument to /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack/unpack_defs.h [ 4%] Hipifying src/device/op128.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/op128.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/device/op128.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/op128.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/op128.h [ 4%] Hipifying src/device/primitives.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/device/primitives.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h Added COLL_UNROLL template argument to /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h Added COLL_UNROLL template argument to /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/op128.h [ 4%] Hipifying src/device/prims_ll.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/device/prims_ll.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h [ 4%] Hipifying src/device/prims_ll128.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/device/prims_ll128.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h Added COLL_UNROLL template argument to /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h [ 5%] Hipifying src/device/prims_simple.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/device/prims_simple.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h Added COLL_UNROLL template argument to /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h [ 5%] Hipifying src/device/reduce.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/device/reduce.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h Added COLL_UNROLL template argument to /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h [ 5%] Hipifying src/device/reduce_kernel.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_kernel.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/device/reduce_kernel.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_kernel.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_kernel.h Added COLL_UNROLL template argument to /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h [ 5%] Hipifying src/device/reduce_scatter.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/device/reduce_scatter.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h Added COLL_UNROLL template argument to /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_kernel.h [ 6%] Hipifying src/device/sendrecv.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/device/sendrecv.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h Added COLL_UNROLL template argument to /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h [ 6%] Hipifying src/enqueue.cc -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/enqueue.cc -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc Added COLL_UNROLL template argument to /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h [ 6%] Hipifying src/graph/connect.cc -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/connect.cc mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/graph/connect.cc -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/connect.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/connect.cc [ 6%] Hipifying src/graph/paths.cc -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/paths.cc mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/graph/paths.cc -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/paths.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/paths.cc [ 6%] Hipifying src/graph/rings.cc -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rings.cc mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/graph/rings.cc -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rings.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rings.cc [ 7%] Hipifying src/graph/rings.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rings.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/graph/rings.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rings.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rings.h [ 7%] Hipifying src/graph/rome_models.cc -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/graph/rome_models.cc -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc [ 7%] Hipifying src/graph/rome_models.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/graph/rome_models.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.h [ 7%] Hipifying src/graph/search.cc -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/search.cc mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/graph/search.cc -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/search.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/search.cc [ 8%] Hipifying src/graph/topo.cc -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.cc mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/graph/topo.cc -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.cc [ 8%] Hipifying src/graph/topo.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h [ 8%] Hipifying src/graph/trees.cc -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/trees.cc mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/graph/topo.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/graph/trees.cc -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/trees.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/trees.cc [ 8%] Hipifying src/graph/tuning.cc -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/tuning.cc mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/graph/tuning.cc -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/tuning.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/tuning.cc [ 9%] Hipifying src/graph/xml.cc -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.cc mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/graph/xml.cc -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.cc [ 9%] Hipifying src/graph/xml.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/graph/xml.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h [ 9%] Hipifying src/group.cc -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/group.cc mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/group.cc -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/group.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/group.cc [ 9%] Hipifying src/include/BfdBacktrace.hpp -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/BfdBacktrace.hpp mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/include/BfdBacktrace.hpp -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/BfdBacktrace.hpp && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/BfdBacktrace.hpp [ 9%] Hipifying src/include/alloc.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/include/alloc.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h [ 9%] Hipifying src/include/alt_rsmi.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alt_rsmi.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/include/alt_rsmi.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alt_rsmi.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alt_rsmi.h [ 9%] Hipifying src/include/api_trace.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/api_trace.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/include/api_trace.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/api_trace.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/api_trace.h [ 10%] Hipifying src/include/archinfo.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/archinfo.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/include/archinfo.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/archinfo.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/archinfo.h [ 10%] Hipifying src/include/argcheck.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/argcheck.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/include/argcheck.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/argcheck.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/argcheck.h [ 11%] Hipifying src/include/bitops.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/bitops.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/include/bitops.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/bitops.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/bitops.h [ 11%] Hipifying src/include/bootstrap.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/bootstrap.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/include/bootstrap.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/bootstrap.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/bootstrap.h [ 11%] Hipifying src/include/channel.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/channel.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/include/channel.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/channel.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/channel.h [ 11%] Hipifying src/include/checks.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/checks.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/include/checks.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/checks.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/checks.h [ 11%] Hipifying src/include/coll_net.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/include/coll_net.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h [ 12%] Hipifying src/include/collectives.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/include/collectives.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h [ 12%] Hipifying src/include/comm.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/include/comm.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h [ 12%] Hipifying src/include/core.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/include/core.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h [ 13%] Hipifying src/include/cpuset.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/cpuset.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/include/cpuset.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/cpuset.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/cpuset.h [ 13%] Hipifying src/include/debug.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/debug.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/include/debug.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/debug.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/debug.h [ 13%] Hipifying src/include/device.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/include/device.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h [ 13%] Hipifying src/include/enqueue.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/enqueue.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/include/enqueue.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/enqueue.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/enqueue.h [ 14%] Hipifying src/include/gdrwrap.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/include/gdrwrap.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h [ 14%] Hipifying src/include/git_version.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/git_version.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/include/git_version.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/git_version.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/git_version.h [ 14%] Hipifying src/include/graph.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/graph.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/include/graph.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/graph.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/graph.h [ 14%] Hipifying src/include/group.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/group.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/include/group.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/group.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/group.h [ 15%] Hipifying src/include/hip_rocm_version_info.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/hip_rocm_version_info.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/include/hip_rocm_version_info.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/hip_rocm_version_info.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/hip_rocm_version_info.h [ 15%] Hipifying src/include/ibvcore.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/ibvcore.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/include/ibvcore.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/ibvcore.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/ibvcore.h [ 15%] Hipifying src/include/ibvsymbols.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/ibvsymbols.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/include/ibvsymbols.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/ibvsymbols.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/ibvsymbols.h [ 15%] Hipifying src/include/ibvwrap.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/ibvwrap.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/include/ibvwrap.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/ibvwrap.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/ibvwrap.h [ 16%] Hipifying src/include/info.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/info.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/include/info.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/info.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/info.h [ 16%] Hipifying src/include/ipcsocket.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/ipcsocket.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/include/ipcsocket.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/ipcsocket.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/ipcsocket.h [ 17%] Hipifying src/include/msccl/msccl_kernel.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl/msccl_kernel.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/include/msccl/msccl_kernel.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl/msccl_kernel.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl/msccl_kernel.h [ 18%] Hipifying src/include/msccl/msccl_lifecycle.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl/msccl_lifecycle.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/include/msccl/msccl_lifecycle.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl/msccl_lifecycle.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl/msccl_lifecycle.h [ 18%] Hipifying src/include/msccl/msccl_parser.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/include/msccl/msccl_parser.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h [ 18%] Hipifying src/include/msccl/msccl_scheduler.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl/msccl_scheduler.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/include/msccl/msccl_scheduler.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl/msccl_scheduler.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl/msccl_scheduler.h [ 18%] Hipifying src/include/msccl/msccl_setup.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl/msccl_setup.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/include/msccl/msccl_setup.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl/msccl_setup.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl/msccl_setup.h [ 19%] Hipifying src/include/msccl/msccl_status.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl/msccl_status.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/include/msccl/msccl_status.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl/msccl_status.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl/msccl_status.h [ 19%] Hipifying src/include/msccl/msccl_struct.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl/msccl_struct.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/include/msccl/msccl_struct.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl/msccl_struct.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl/msccl_struct.h [ 19%] Hipifying src/include/nccl_common.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nccl_common.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/include/nccl_common.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nccl_common.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nccl_common.h [ 19%] Hipifying src/include/nccl_net.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nccl_net.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/include/nccl_net.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nccl_net.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nccl_net.h [ 20%] Hipifying src/include/nccl_tuner.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nccl_tuner.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/include/nccl_tuner.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nccl_tuner.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nccl_tuner.h [ 20%] Hipifying src/include/net.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/net.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/include/net.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/net.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/net.h [ 20%] Hipifying src/include/net_device.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/net_device.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/include/net_device.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/net_device.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/net_device.h [ 20%] Hipifying src/include/npkit/npkit.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/npkit/npkit.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/npkit && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/include/npkit/npkit.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/npkit/npkit.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/npkit/npkit.h [ 20%] Hipifying src/include/npkit/npkit_event.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/npkit/npkit_event.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/npkit && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/include/npkit/npkit_event.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/npkit/npkit_event.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/npkit/npkit_event.h [ 21%] Hipifying src/include/npkit/npkit_struct.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/npkit/npkit_struct.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/npkit && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/include/npkit/npkit_struct.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/npkit/npkit_struct.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/npkit/npkit_struct.h [ 21%] Hipifying src/include/nvmlwrap.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvmlwrap.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/include/nvmlwrap.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvmlwrap.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvmlwrap.h [ 22%] Hipifying src/include/nvtx.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/include/nvtx.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx.h [ 22%] Hipifying src/include/nvtx3/nvToolsExt.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExt.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3 && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/include/nvtx3/nvToolsExt.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExt.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExt.h [ 22%] Hipifying src/include/nvtx3/nvToolsExtCounters.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtCounters.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3 && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/include/nvtx3/nvToolsExtCounters.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtCounters.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtCounters.h [ 22%] Hipifying src/include/nvtx3/nvToolsExtCuda.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtCuda.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3 && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/include/nvtx3/nvToolsExtCuda.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtCuda.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtCuda.h [ 23%] Hipifying src/include/nvtx3/nvToolsExtCudaRt.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtCudaRt.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3 && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/include/nvtx3/nvToolsExtCudaRt.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtCudaRt.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtCudaRt.h [ 23%] Hipifying src/include/nvtx3/nvToolsExtMem.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtMem.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3 && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/include/nvtx3/nvToolsExtMem.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtMem.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtMem.h [ 23%] Hipifying src/include/nvtx3/nvToolsExtMemCudaRt.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtMemCudaRt.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3 && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/include/nvtx3/nvToolsExtMemCudaRt.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtMemCudaRt.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtMemCudaRt.h [ 23%] Hipifying src/include/nvtx3/nvToolsExtOpenCL.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtOpenCL.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3 && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/include/nvtx3/nvToolsExtOpenCL.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtOpenCL.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtOpenCL.h [ 24%] Hipifying src/include/nvtx3/nvToolsExtPayload.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtPayload.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3 && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/include/nvtx3/nvToolsExtPayload.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtPayload.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtPayload.h [ 24%] Hipifying src/include/nvtx3/nvToolsExtPayloadHelper.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtPayloadHelper.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3 && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/include/nvtx3/nvToolsExtPayloadHelper.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtPayloadHelper.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtPayloadHelper.h [ 24%] Hipifying src/include/nvtx3/nvToolsExtSemanticsCounters.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtSemanticsCounters.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3 && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/include/nvtx3/nvToolsExtSemanticsCounters.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtSemanticsCounters.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtSemanticsCounters.h [ 24%] Hipifying src/include/nvtx3/nvToolsExtSemanticsScope.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtSemanticsScope.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3 && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/include/nvtx3/nvToolsExtSemanticsScope.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtSemanticsScope.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtSemanticsScope.h [ 25%] Hipifying src/include/nvtx3/nvToolsExtSync.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtSync.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3 && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/include/nvtx3/nvToolsExtSync.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtSync.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvToolsExtSync.h [ 25%] Hipifying src/include/nvtx3/nvtx3.hpp -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvtx3.hpp mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3 && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/include/nvtx3/nvtx3.hpp -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvtx3.hpp && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvtx3.hpp [ 25%] Hipifying src/include/nvtx3/nvtxDetail/nvtxExtHelperMacros.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxExtHelperMacros.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/include/nvtx3/nvtxDetail/nvtxExtHelperMacros.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxExtHelperMacros.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxExtHelperMacros.h [ 25%] Hipifying src/include/nvtx3/nvtxDetail/nvtxExtImpl.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxExtImpl.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/include/nvtx3/nvtxDetail/nvtxExtImpl.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxExtImpl.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxExtImpl.h [ 26%] Hipifying src/include/nvtx3/nvtxDetail/nvtxExtImplCounters_v1.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxExtImplCounters_v1.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/include/nvtx3/nvtxDetail/nvtxExtImplCounters_v1.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxExtImplCounters_v1.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxExtImplCounters_v1.h [ 26%] Hipifying src/include/nvtx3/nvtxDetail/nvtxExtImplMemCudaRt_v1.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxExtImplMemCudaRt_v1.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/include/nvtx3/nvtxDetail/nvtxExtImplMemCudaRt_v1.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxExtImplMemCudaRt_v1.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxExtImplMemCudaRt_v1.h [ 26%] Hipifying src/include/nvtx3/nvtxDetail/nvtxExtImplMem_v1.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxExtImplMem_v1.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/include/nvtx3/nvtxDetail/nvtxExtImplMem_v1.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxExtImplMem_v1.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxExtImplMem_v1.h [ 26%] Hipifying src/include/nvtx3/nvtxDetail/nvtxExtImplPayload_v1.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxExtImplPayload_v1.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/include/nvtx3/nvtxDetail/nvtxExtImplPayload_v1.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxExtImplPayload_v1.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxExtImplPayload_v1.h [ 27%] Hipifying src/include/nvtx3/nvtxDetail/nvtxExtInit.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxExtInit.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/include/nvtx3/nvtxDetail/nvtxExtInit.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxExtInit.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxExtInit.h [ 27%] Hipifying src/include/nvtx3/nvtxDetail/nvtxExtPayloadHelperInternal.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxExtPayloadHelperInternal.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/include/nvtx3/nvtxDetail/nvtxExtPayloadHelperInternal.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxExtPayloadHelperInternal.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxExtPayloadHelperInternal.h [ 27%] Hipifying src/include/nvtx3/nvtxDetail/nvtxExtPayloadTypeInfo.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxExtPayloadTypeInfo.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/include/nvtx3/nvtxDetail/nvtxExtPayloadTypeInfo.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxExtPayloadTypeInfo.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxExtPayloadTypeInfo.h [ 27%] Hipifying src/include/nvtx3/nvtxDetail/nvtxExtTypes.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxExtTypes.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/include/nvtx3/nvtxDetail/nvtxExtTypes.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxExtTypes.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxExtTypes.h [ 28%] Hipifying src/include/nvtx3/nvtxDetail/nvtxImpl.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxImpl.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/include/nvtx3/nvtxDetail/nvtxImpl.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxImpl.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxImpl.h [ 28%] Hipifying src/include/nvtx3/nvtxDetail/nvtxImplCore.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxImplCore.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/include/nvtx3/nvtxDetail/nvtxImplCore.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxImplCore.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxImplCore.h [ 28%] Hipifying src/include/nvtx3/nvtxDetail/nvtxImplCudaRt_v3.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxImplCudaRt_v3.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/include/nvtx3/nvtxDetail/nvtxImplCudaRt_v3.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxImplCudaRt_v3.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxImplCudaRt_v3.h [ 28%] Hipifying src/include/nvtx3/nvtxDetail/nvtxImplCuda_v3.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxImplCuda_v3.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/include/nvtx3/nvtxDetail/nvtxImplCuda_v3.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxImplCuda_v3.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxImplCuda_v3.h [ 29%] Hipifying src/include/nvtx3/nvtxDetail/nvtxImplOpenCL_v3.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxImplOpenCL_v3.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/include/nvtx3/nvtxDetail/nvtxImplOpenCL_v3.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxImplOpenCL_v3.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxImplOpenCL_v3.h [ 29%] Hipifying src/include/nvtx3/nvtxDetail/nvtxImplSync_v3.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxImplSync_v3.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/include/nvtx3/nvtxDetail/nvtxImplSync_v3.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxImplSync_v3.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxImplSync_v3.h [ 29%] Hipifying src/include/nvtx3/nvtxDetail/nvtxInit.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxInit.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/include/nvtx3/nvtxDetail/nvtxInit.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxInit.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxInit.h [ 29%] Hipifying src/include/nvtx3/nvtxDetail/nvtxInitDecls.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxInitDecls.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/include/nvtx3/nvtxDetail/nvtxInitDecls.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxInitDecls.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxInitDecls.h [ 30%] Hipifying src/include/nvtx3/nvtxDetail/nvtxInitDefs.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxInitDefs.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/include/nvtx3/nvtxDetail/nvtxInitDefs.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxInitDefs.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxInitDefs.h [ 30%] Hipifying src/include/nvtx3/nvtxDetail/nvtxLinkOnce.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxLinkOnce.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/include/nvtx3/nvtxDetail/nvtxLinkOnce.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxLinkOnce.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxLinkOnce.h [ 30%] Hipifying src/include/nvtx3/nvtxDetail/nvtxTypes.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxTypes.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/include/nvtx3/nvtxDetail/nvtxTypes.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxTypes.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx3/nvtxDetail/nvtxTypes.h [ 30%] Hipifying src/include/nvtx_stub.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx_stub.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/include/nvtx_stub.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx_stub.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx_stub.h [ 30%] Hipifying src/include/p2p.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/include/p2p.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h [ 30%] Hipifying src/include/param.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/param.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/include/param.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/param.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/param.h [ 30%] Hipifying src/include/profiler.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/profiler.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/include/profiler.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/profiler.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/profiler.h [ 31%] Hipifying src/include/proxy.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/proxy.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/include/proxy.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/proxy.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/proxy.h [ 31%] Hipifying src/include/rccl_float8.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/include/rccl_float8.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h [ 31%] Hipifying src/include/rccl_vars.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_vars.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/include/rccl_vars.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_vars.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_vars.h [ 31%] Hipifying src/include/register.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/register.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/include/register.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/register.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/register.h [ 32%] Hipifying src/include/rocm_smi_wrap.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rocm_smi_wrap.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/include/rocm_smi_wrap.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rocm_smi_wrap.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rocm_smi_wrap.h [ 32%] Hipifying src/include/rocmwrap.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rocmwrap.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/include/rocmwrap.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rocmwrap.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rocmwrap.h [ 32%] Hipifying src/include/roctx.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/roctx.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/include/roctx.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/roctx.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/roctx.h [ 32%] Hipifying src/include/shm.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/shm.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/include/shm.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/shm.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/shm.h [ 33%] Hipifying src/include/signals.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/signals.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/include/signals.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/signals.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/signals.h [ 33%] Hipifying src/include/socket.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/socket.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/include/socket.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/socket.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/socket.h [ 33%] Hipifying src/include/strongstream.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/strongstream.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/include/strongstream.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/strongstream.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/strongstream.h [ 33%] Hipifying src/include/timer.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/timer.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/include/timer.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/timer.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/timer.h [ 34%] Hipifying src/include/transport.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/transport.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/include/transport.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/transport.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/transport.h [ 34%] Hipifying src/include/trees.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/trees.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/include/trees.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/trees.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/trees.h [ 34%] Hipifying src/include/tuner.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/tuner.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/include/tuner.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/tuner.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/tuner.h [ 34%] Hipifying src/include/utils.h -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/include/utils.h -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h [ 34%] Hipifying src/init.cc -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/init.cc -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc [ 35%] Hipifying src/init_nvtx.cc -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init_nvtx.cc mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/init_nvtx.cc -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init_nvtx.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init_nvtx.cc [ 35%] Hipifying src/misc/alt_rsmi.cc -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/alt_rsmi.cc mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/misc/alt_rsmi.cc -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/alt_rsmi.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/alt_rsmi.cc [ 35%] Hipifying src/misc/api_trace.c -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/api_trace.c mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/misc/api_trace.c -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/api_trace.c && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/api_trace.c [ 35%] Hipifying src/misc/api_trace.cc -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/api_trace.cc mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/misc/api_trace.cc -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/api_trace.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/api_trace.cc [ 36%] Hipifying src/misc/archinfo.cc -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/archinfo.cc mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/misc/archinfo.cc -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/archinfo.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/archinfo.cc [ 36%] Hipifying src/misc/argcheck.cc -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/argcheck.cc [ 37%] Hipifying src/misc/ibvsymbols.cc -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/ibvsymbols.cc mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/misc/argcheck.cc -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/argcheck.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/argcheck.cc mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/misc/ibvsymbols.cc -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/ibvsymbols.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/ibvsymbols.cc [ 37%] Hipifying src/misc/ibvwrap.cc -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/ibvwrap.cc mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/misc/ibvwrap.cc -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/ibvwrap.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/ibvwrap.cc [ 37%] Hipifying src/misc/ipcsocket.cc -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/ipcsocket.cc mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/misc/ipcsocket.cc -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/ipcsocket.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/ipcsocket.cc [ 37%] Hipifying src/misc/msccl/msccl_lifecycle.cc -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/misc/msccl/msccl_lifecycle.cc -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc [ 38%] Hipifying src/misc/msccl/msccl_parser.cc -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/misc/msccl/msccl_parser.cc -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc [ 38%] Hipifying src/misc/msccl/msccl_setup.cc -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/misc/msccl/msccl_setup.cc -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc [ 38%] Hipifying src/misc/msccl/msccl_status.cc -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_status.cc mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/misc/msccl/msccl_status.cc -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_status.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_status.cc [ 38%] Hipifying src/misc/npkit.cc -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/npkit.cc mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/misc/npkit.cc -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/npkit.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/npkit.cc [ 39%] Hipifying src/misc/nvmlwrap_stub.cc -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/nvmlwrap_stub.cc mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/misc/nvmlwrap_stub.cc -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/nvmlwrap_stub.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/nvmlwrap_stub.cc [ 39%] Hipifying src/misc/param.cc -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/param.cc mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/misc/param.cc -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/param.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/param.cc [ 39%] Hipifying src/misc/profiler.cc -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/profiler.cc mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/misc/profiler.cc -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/profiler.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/profiler.cc [ 39%] Hipifying src/misc/rocm_smi_wrap.cc -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/rocm_smi_wrap.cc mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/misc/rocm_smi_wrap.cc -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/rocm_smi_wrap.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/rocm_smi_wrap.cc [ 40%] Hipifying src/misc/rocmwrap.cc -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/rocmwrap.cc mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/misc/rocmwrap.cc -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/rocmwrap.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/rocmwrap.cc [ 40%] Hipifying src/misc/roctx.cc -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/roctx.cc [ 40%] Hipifying src/misc/shmutils.cc -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/shmutils.cc mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/misc/roctx.cc -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/roctx.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/roctx.cc mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/misc/shmutils.cc -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/shmutils.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/shmutils.cc [ 40%] Hipifying src/misc/signals.cc -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/signals.cc mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/misc/signals.cc -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/signals.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/signals.cc [ 41%] Hipifying src/misc/socket.cc -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/socket.cc mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/misc/socket.cc -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/socket.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/socket.cc [ 41%] Hipifying src/misc/strongstream.cc -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/strongstream.cc mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/misc/strongstream.cc -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/strongstream.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/strongstream.cc [ 41%] Hipifying src/misc/tuner.cc -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/tuner.cc mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/misc/tuner.cc -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/tuner.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/tuner.cc [ 41%] Hipifying src/misc/utils.cc -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/utils.cc mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/misc/utils.cc -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/utils.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/utils.cc [ 41%] Hipifying src/msccl.cc -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/msccl.cc mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/msccl.cc -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/msccl.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/msccl.cc [ 41%] Hipifying src/net.cc -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/net.cc mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/net.cc -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/net.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/net.cc [ 41%] Hipifying src/proxy.cc -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/proxy.cc mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/proxy.cc -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/proxy.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/proxy.cc [ 42%] Hipifying src/register.cc -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/register.cc mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/register.cc -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/register.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/register.cc [ 42%] Hipifying src/transport.cc -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport.cc mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/transport.cc -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport.cc [ 42%] Hipifying src/transport/coll_net.cc -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/coll_net.cc mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/transport/coll_net.cc -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/coll_net.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/coll_net.cc [ 43%] Hipifying src/transport/generic.cc -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/generic.cc mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/transport/generic.cc -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/generic.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/generic.cc [ 43%] Hipifying src/transport/net_ib.cc -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_ib.cc mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/transport/net_ib.cc -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_ib.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_ib.cc [ 43%] Hipifying src/transport/net_socket.cc -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_socket.cc mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/transport/net_socket.cc -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_socket.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_socket.cc [ 43%] Hipifying src/transport/net.cc -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_tmp.cc mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/transport/net.cc -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_tmp.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_tmp.cc [ 44%] Hipifying src/transport/nvls.cc -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/nvls.cc mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/transport/nvls.cc -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/nvls.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/nvls.cc [ 44%] Hipifying src/transport/p2p.cc -> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/p2p.cc mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport && /usr/bin/hipify-perl -quiet-warnings /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/src/transport/p2p.cc -o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/p2p.cc && /usr/bin/cmake -E env bash /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/cmake/scripts/add_unroll.sh /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/p2p.cc cd /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build && /usr/bin/cmake -E cmake_depends "Unix Makefiles" /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1 /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1 /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/CMakeFiles/rccl.dir/DependInfo.cmake "--color=" gmake[2]: Leaving directory '/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build' /usr/bin/gmake -f CMakeFiles/rccl.dir/build.make CMakeFiles/rccl.dir/build gmake[2]: Entering directory '/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build' [ 44%] Building CXX object CMakeFiles/rccl.dir/hipify/src/bootstrap.cc.o [ 44%] Building CXX object CMakeFiles/rccl.dir/hipify/src/channel.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/bootstrap.cc.o -MF CMakeFiles/rccl.dir/hipify/src/bootstrap.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/bootstrap.cc.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/bootstrap.cc /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/channel.cc.o -MF CMakeFiles/rccl.dir/hipify/src/channel.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/channel.cc.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/channel.cc In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/channel.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/channel.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/channel.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissaIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/bootstrap.cc:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/bootstrap.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ ; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/bootstrap.cc:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/bootstrap.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/bootstrap.cc:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/bootstrap.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/bootstrap.cc:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/bootstrap.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/channel.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/bootstrap.cc:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/bootstrap.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/channel.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/channel.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77In file included from | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/channel.cc:7 : In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h: 14 : In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h: 15 ui: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:nt32_t y, head, mantissa;14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18 : warning: unused variable 'y' [-Wunused-variable] 77 | | ^ uint32_t y, head,In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/channel.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/channel.h:m9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.hantissa:;14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:In file included from | ^ 14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/bootstrap.cc:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/bootstrap.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/bootstrap.cc:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/bootstrap.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/channel.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/bootstrap.cc:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/bootstrap.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/bootstrap.cc:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/bootstrap.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/channel.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/bootstrap.cc:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/bootstrap.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/bootstrap.cc:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/bootstrap.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/channel.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/bootstrap.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/bootstrap.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/bootstrap.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/bootstrap.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/bootstrap.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/bootstrap.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/bootstrap.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/bootstrap.cc/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: :9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/bootstrap.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/bootstrap.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/bootstrap.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ 2 warnings generated when compiling for gfx90a. 2 warnings generated when compiling for gfx942. 2 warnings generated when compiling for gfx1102. 2 warnings generated when compiling for gfx1200. 2 warnings generated when compiling for gfx908. 2 warnings generated when compiling for gfx1201. 2 warnings generated when compiling for gfx1030. 2 warnings generated when compiling for gfx906. 2 warnings generated when compiling for gfx1101. 2 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/channel.cc:9: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/channel.cc:9: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/channel.cc:9: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/channel.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/channel.cc:9: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:224:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 224 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/channel.cc:9: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable]In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/channel.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/channel.cc:9: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:224:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 224 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/channel.cc:9: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/channel.cc:9: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mIn file included from e/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/channel.cc:9: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ m_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ 2 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/channel.cc:9: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/channel.cc:9: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/channel.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/channel.cc:9: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:224:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 224 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/channel.cc:9: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/channel.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/channel.cc:9: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:224:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 224 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/channel.cc:9: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ [ 44%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives.cc.o -MF CMakeFiles/rccl.dir/hipify/src/collectives.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives.cc.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/channel.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/channel.cc:9: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:224:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 224 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/channel.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/channel.cc:9: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:224:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 224 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/channel.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/channel.cc:9: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:224:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 224 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/channel.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/channel.cc:9: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:224:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 224 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/channel.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/channel.cc:9: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:224:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 224 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ 8 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/channel.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/channel.cc:9: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:224:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 224 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/channel.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/channel.cc:9: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:224:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 224 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ 8 warnings generated when compiling for gfx90a. 8 warnings generated when compiling for gfx1100. 8 warnings generated when compiling for host. 8 warnings generated when compiling for gfx908. 8 warnings generated when compiling for gfx1201. 8 warnings generated when compiling for gfx906. 8 warnings generated when compiling for gfx1030. 8 warnings generated when compiling for gfx1101. 8 warnings generated when compiling for gfx1102. 8 warnings generated when compiling for gfx942. [ 45%] Building CXX object CMakeFiles/rccl.dir/hipify/src/debug.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/debug.cc.o -MF CMakeFiles/rccl.dir/hipify/src/debug.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/debug.cc.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/debug.cc In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:93:38: warning: unused variable 'AllGatherSchema' [-Wunused-variable] 93 | constexpr nvtxPayloadSchemaEntry_t AllGatherSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:98:23: warning: unused variable 'payload' [-Wunused-variable] 98 | NvtxParamsAllGather payload{sendcount * ncclTypeSize(datatype), datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:126:45: warning: unused variable 'AllReduceSchema' [-Wunused-variable] 126 | static constexpr nvtxPayloadSchemaEntry_t AllReduceSchema[] = {/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:93:38: warning: unused variable 'AllGatherSchema' [-Wunused-variable] 93 | constexpr nvtxPayloadSchemaEntry_t AllGatherSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:98:23: warning: unused variable 'payload' [-Wunused-variable] 98 | NvtxParamsAllGather payload{sendcount * ncclTypeSize(datatype), datatype}; | ^~~~~~~ | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:132:23: warning: unused variable 'payload' [-Wunused-variable] 132 | NvtxParamsAllReduce payload{count * ncclTypeSize(datatype), op, da/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:126:45: warning: unused variable 'AllReduceSchema' [-Wunused-variable] 126 | static constexpr nvtxPayloadSchemaEntry_t AllReduceSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:132:23: warning: unused variable 'payload' [-Wunused-variable] 132 | NvtxParamsAllReduce payload{count * ncclTypeSize(datatype), op, datatype}; | ^~~~~~~ tatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:161:38: warning: unused variable 'AllToAllSchema' [-Wunused-variable] 161 | constexpr nvtxPayloadSchemaEntry_t AllToAllSchema[] = { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:166:22: warning: unused variable 'payload' [-Wunused-variable] 166 | NvtxParamsAllToAll payload{count * ncclTypeSize(datatype), datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:161:38: warning: unused variable 'AllToAllSchema' [-Wunused-variable] 161 | constexpr nvtxPayloadSchemaEntry_t AllToAllSchema[] = { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:166:22: warning: unused variable 'payload' [-Wunused-variable] 166 | NvtxParamsAllToAll payload{count * ncclTypeSize(datatype), datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:93:38: warning: unused variable 'AllGatherSchema' [-Wunused-variable] 93 | constexpr nvtxPayloadSchemaEntry_t AllGatherSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:98:23: warning: unused variable 'payload' [-Wunused-variable] 98 | NvtxParamsAllGather payload{sendcount * ncclTypeSize(datatype), datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:126:45: warning: unused variable 'AllReduceSchema' [-Wunused-variable] 126 | static constexpr nvtxPayloadSchemaEntry_t AllReduceSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:132:23: warning: unused variable 'payload' [-Wunused-variable] 132 | NvtxParamsAllReduce payload{count * ncclTypeSize(datatype), op, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:212:38: warning: unused variable 'AllToAllvSchema' [-Wunused-variable] 212 | constexpr nvtxPaylo/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:93:38: warning: unused variable 'AllGatherSchema' [-Wunused-variable] 93 | constexpr nvtxPayloadSchemaEntry_t AllGatherSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:98:23: warning: unused variable 'payload' [-Wunused-variable] 98 | NvtxParamsAllGather payload{sendcount * ncclTypeSize(datatype), datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:126:45: warning: unused variable 'AllReduceSchema' [-Wunused-variable] 126 | static constexpr nvtxPayloadSchemaEntry_t AllReduceSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:132:23: warning: unused variable 'payload' [-Wunused-variable] 132 | NvtxParamsAllReduce payload{count * ncclTypeSize(datatype), op, datatype}; | ^~~~~~~ adSchemaEntry_t AllToAllvSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:219:23: warning: unused variable 'payload' [-Wunused-variable] 219 | NvtxParamsAllToAll/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:212:38: warning: unused variable 'AllToAllvSchema' [-Wunused-variable] 212 | constexpr nvtxPayloadSchemaEntry_t AllToAllvSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:219:23: warning: unused variable 'payload' [-Wunused-variable] 219 | NvtxParamsAllToAllv payload{sendcounts[comm->rank] * ncclTypeSize(datatype), recvcounts[comm->rank] * ncclTypeSize(datatype), datatype}; | ^~~~~~~ v payload{sendcounts[comm->rank] * ncclTypeSize(datatype), recvcounts[comm->rank] * ncclTypeSize(datatype), datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:261:38: warning: unused variable 'BroadcastSchema' [-Wunused-variable] 261 | constexpr nvtxPayloadSchemaEntry_t BroadcastSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:267:23: warning: unused variable 'payload' [-Wunused-variable] 267 | NvtxParamsBroadcast payload{count * ncclTy/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:p261eSize(datatype), root, datatype}; | ^~~~~~~ :38: warning: unused variable 'BroadcastSchema' [-Wunused-variable] 261 | constexpr nvtxPayloadSchemaEntry_t BroadcastSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:267:23:/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:161:38: warning: unused variable 'AllToAllSchema' [-Wunused-variable] 161 | constexpr nvtxPayloadSchemaEntry_t AllToAllSchema[] = { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:166:22: warning: unused variable 'payload' [-Wunused-variable] 166 | NvtxParamsAllToAll payload{count * ncclTypeSize(datatype), datatype}; | ^~~~~~~ warning: unused variable 'payload' [-Wunused-variable] 267 | NvtxParamsBroadcast payload{count * nc/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:161:38: warning: unused variable 'AllToAllSchema' [-Wunused-variable] 161 | constexpr nvtxPayloadSchemaEntry_t AllToAllSchema[] = { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:166:22: warning: unused variable 'payload' [-Wunused-variable] 166 | NvtxParamsAllToAll payload{count * ncclTypeSize(datatype), datatype}; | ^~~~~~~ clTypeSize(datatype), root, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:301:40: warning: unused variable 'GatherSchema' [-Wunused-variable] 301 | constexpr nvtxPayloadSchemaEntry_t GatherSchema[] = { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:307:22: warning: unused variable 'payload' [-Wunused-variable] 307 | NvtxParamsGather payload{sendcount * ncclTypeSize(datatype), root, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:212:38: warning: unused variable 'AllToAllvSchema' [-Wunused-variable] 212 | constexpr nvtxPayloadSchemaEntry_t AllToAllvSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:219:23: warning: unused variable 'payload' [-Wunused-variable] 219 | NvtxParamsAllToAllv payload{sendcounts[comm->rank] * ncclTypeSize(datatype), recvcounts[comm->rank] * ncclTypeSize(datatype), datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:212:38: warning: unused variable 'AllToAllvSchema' [-Wunused-variable] 212 | constexpr nvtxPayloadSchemaEntry_t AllToAllvSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:219:23: warning: unused variable 'payload' [-Wunused-variable] 219 | NvtxParamsAllToAllv payload{sendcounts[comm->rank] * ncclTypeSize(datatype), recvcounts[comm->rank] * ncclTypeSize(datatype), datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:261:38: warning: unused variable 'BroadcastSchema' [-Wunused-variable] 261 | constexpr nvtxPayloadSchemaEntry_t BroadcastSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:267:23: warning: unused variable 'payload' [-Wunused-variable] 267 | NvtxParamsBroadcast payload{count * ncclTypeSize(datatype), root, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:343:38: warning: unused variable 'ReduceSchema' [-Wunused-variable] 343 | constexpr nvtxPayloadSchemaEntry_t ReduceSchema[] = { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:351:20: warning: unused variable 'payload' [-Wunused-variable] 351 | NvtxParamsReduce payload{count * ncclTypeSize(datatype), root, op, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:301:40: warning: unused variable 'GatherSchema' [-Wunused-variable] 301 | constexpr nvtxPayloadSchemaEntry_t GatherSchema[] = { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:307:22: warning: unused variable 'payload' [-Wunused-variable] 307 | NvtxParamsGather payload{sendcount * ncclTypeSize(datatype), root, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:261:38: warning: unused variable 'BroadcastSchema' [-Wunused-variable] 261 | constexpr nvtxPayloadSchemaEntry_t BroadcastSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:267:23: warning: unused variable 'payload' [-Wunused-variable] 267 | NvtxParamsBroadcast payload{count * ncclTypeSize(datatype), root, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:378:38: warning: unused variable 'ReduceScatterSchema' [-Wunused-variable] 378 | constexpr nvtxPayloadSchemaEntry_t ReduceScatterSchema[] = { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:385:27: warning: unused variable 'payload' [-Wunused-variable] 385 | NvtxParamsReduceScatter payload{recvcount * ncclTypeSize(datatype), op, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:343:38: warning: unused variable 'ReduceSchema' [-Wunused-variable] 343 | constexpr nvtxPayloadSchemaEntry_t ReduceSchema[] = { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:351:20: warning: unused variable 'payload' [-Wunused-variable] 351 | NvtxParamsReduce payload{count * ncclTypeSize(datatype), root, op, da/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:412:40: warning: unused variable 'ScatterSchema' [-Wunused-variable] 412 | constexpr nvtxPayloadSchemaEntry_t ScatterSchema[] = { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:418:23: warning: unused variable 'payload' [-Wunused-variable] 418 | NvtxParamsScatter payload{recvcount * ncclTypeSize(datatype), root, datatype}; | ^~~~~~~ tatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:301:40: warning: unused variable 'GatherSchema' [-Wunused-variable] 301 | constexpr nvtxPayloadSchemaEntry_t GatherSchema[] = { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:307:22: warning: unused variable 'payload' [-Wunused-variable] 307 | NvtxParamsGather payload{sendcount * ncclTypeSize(datatype), root, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:301:40: warning: unused variable 'GatherSchema' [-Wunused-variable] 301 | constexpr nvtxPayloadSchemaEntry_t GatherSchema[] = { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:307:22: warning: unused variable 'payload' [-Wunused-variable] 307 | NvtxParamsGather payload{sendcount * ncclTypeSize(datatype), root, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:343:38: warning: unused variable 'ReduceSchema' [-Wunused-variable] 343 | constexpr nvtxPayloadSchemaEntry_t ReduceSchema[] = { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:351:20: warning: unused variable 'payload' [-Wunused-variable] 351 | NvtxParamsReduce payload{count * ncclTypeSize(datatype), root, op, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:378:38: warning: unused variable 'ReduceScatterSchema' [-Wunused-variable] 378 | constexpr nvtxPayloadSchemaEntry_t ReduceScatterSchema[] = { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:385:27: warning: unused variable 'payload' [-Wunused-variable] 385 | NvtxParamsReduceScatter payload{recvcount * ncclTypeSize(datatype), /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:343:38: warning: unused variable 'ReduceSchema' [-Wunused-variable] 343 | constexpr nvtxPayloadSchemaEntry_t ReduceSchema[] = { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:351:20: warning: unused variable 'payload' [-Wunused-variable] 351 | NvtxParamsReduce payload{count * ncclTypeSize(datatype), root, op, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:378:38: warning: unused variable 'ReduceScatterSchema' [-Wunused-variable] 378 | constexpr nvtxPayloadSchemaEntry_t ReduceScatterSchema[] = { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:385:27: warning: unused variable 'payload' [-Wunused-variable] 385 | NvtxParamsReduceScatter payload{recvcount * ncclTypeSize(datatype), op, datatype}; | ^~~~~~~ op, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:378:38: warning: unused variable 'ReduceScatterSchema' [-Wunused-variable] 378 | constexpr nvtxPayloadSchemaEntry_t ReduceScatterSchema[] = { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:385:27: warning: unused variable 'payload' [-Wunused-variable] 385 | NvtxParamsReduceScatter payload{recvcount * ncclTypeSize(datatype), op, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:461:22: warning: unused variable 'payload' [-Wunused-variable] 461 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:412:40: warning: unused variable 'ScatterSchema' [-Wunused-variable] 412 | constexpr nvtxPayloadSchemaEntry_t ScatterSchema[] = { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:418:23: warning: unused variable 'payload' [-Wunused-variable] 418 | NvtxParamsScatter payload{recvcount * ncclTypeSize(datatype), root, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:412:40: warning: unused variable 'ScatterSchema' [-Wunused-variable] 412 | constexpr nvtxPayloadSchemaEntry_t ScatterSchema[] = { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:418:/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:486:22: warning: unused variable 'payload' [-Wunused-variable] 486 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer, datatype}; | ^~~~~~~ 23: warning: unused variable 'payload' [-Wunused-variable] 418 | NvtxParamsScatter payload{recvcount * ncclTypeSize(datatype), root, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:461:22: warning: unused variable 'payload' [-Wunused-variable] 461 | N/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:412:40: warning: unused variable 'ScatterSchema' [-Wunused-variable] 412 | constexpr nvtxPayloadSchemaEntry_t ScatterSchema[] = { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:418:23: warning: unused variable 'payload' [-Wunused-variable] 418 | NvtxParamsScatter payload{recvcount * ncclTypeSize(datatype), root, datatype}; | ^~~~~~~ vtxParamsSendRecv payload{count * ncclType/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:461:22: warning: unused variable 'payload' [-Wunused-variable] 461 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer, datatype}; | ^~~~~~~ Size(datatype), peer, dat/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:486:22: warning: unused variable 'payload' [-Wunused-variable] 486 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer, datatype}; | ^~~~~~~ atype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:461:22: warning: unused variable 'payload' [-Wunused-variable] 461 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer, datat/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:486:22: warning: unused variable 'payload' [-Wunused-variable] 486 | Nyvpe}; | ^~~~~~~ txParamsSendRecv payload{count * ncclTypeSize(datatype), peer, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:486:22: warning: unused variable 'payload' [-Wunused-variable] 486 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:93:38: warning: unused variable 'AllGatherSchema' [-Wunused-variable] 93 | constexpr nvtxPayloadSchemaEntry_t AllGatherSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:98:23: warning: unused variable 'payload' [-Wunused-variable] 98 | NvtxParamsAllGather payload{sendcount * ncclTypeSize(datatype), datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:93:38: warning: unused variable 'AllGatherSchema' [-Wunused-variable] 93 | constexpr nvtxPayloadSchemaEntry_t AllGatherSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:98:23: warning: unused variable 'payload' [-Wunused-variable] 98 | NvtxParamsAllGather payload{sendcount * ncclTypeSize(datatype), datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:93:38: warning: unused variable 'AllGatherSchema' [-Wunused-variable] 93 | constexpr nvtxPayloadSchemaEntry_t AllGatherSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:98/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:126:45: warning: unused variable 'AllReduceSchema' [-Wunused-variable] 126 | static constexpr nvtxPayloadSchemaEntry_t AllReduceSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:132:23: warning: unused variable 'payload' [-Wunused-variable] 132 | NvtxParamsAllReduce payload{count * ncclTypeSize(datatype), op, datatype}; | ^~~~~~~ :23: warning: unused variable 'payload' [-Wunused-variable] 98 | NvtxParamsAllGather payload{sendcount * ncclTypeSize(datatype), datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:126:45: warning: unused variable 'AllReduceSchema' [-Wunused-variable] 126 | static constexpr nvtxPayloadSchemaEntry_t AllReduceSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:132:23: warning: unused variable 'payload' [-Wunused-variable] 132 | NvtxParamsAllReduce payload:93:38: warning: unused variable 'AllGatherSchema' [-Wunused-variable] 93 | constexpr nvtxPayloadSchemaEntry_t AllGatherSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:98:23: warning: unused variable 'payload' [-Wunused-variable] 98 | NvtxParamsAllGather payload{sendcount * {conunt * nccclTypeSizce(datatyple), op, Tdatatypey}; | ^~~~~~~ peSize(datatype), datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:126:45: warning: unused variable 'AllReduceSchema' [-Wunused-variable] 126 | static constexpr nvtxPayloadSchemaEntry_t AllReduceSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:132:23: warning: unused variable 'payload' [-Wunused-variable] 132 | NvtxParamsAllReduce payload{count * nccl/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.ccTypeSize(datatype), op, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:161:38: warning: unused variable 'AllToAllSchema' [-Wunused-variable] 161 | constexpr nvtxPayloadSchemaEntry_t AllToAllSchema[] = { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:166:22: warning: unused variable 'payload' [-Wunused-variable] 166 | NvtxParamsAllToAll pay:161:38: warning: unused variable 'AllToAllSchema' [-Wunused-variable] 161 | co:126:n45: warning: unused variable 'AllReduceSchema' [-Wunused-variable] 126 | static constexpr nvtxPayloadSchemaEntry_t AllReduceSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:132:23: warning: unused variable 'payload' [-Wunused-variable] 132 | NvtxParamsAllReduce payload{count * ncclTypeSize(datatype), op, datatype}; | ^~~~~~~ stexpr nvtxPayloadSchemaEntry_t AllToAllSchema[] = { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:166:22: warning: unused variable 'payload' [-Wunused-variable] 166 | NvtxParamsAllToAll payload{countl oad{count * ncclTypeSize(datatype), datatype}; | ^~~~~~~ * ncclTypeSize(datatype), datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:161:38: warning: unused variable 'AllToAllSchema' [-Wunused-variable] 161 | constexpr nvtxPayloadSchemaEntry_t AllToAllSchema[] = { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:166/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:93:38: warning: unused variable 'AllGatherSchema' [-Wunused-variable] 93 | constexpr nvtxPayloadSchemaEntry_t AllGatherSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:98:23: warning: unused variable 'payload' [-Wunused-variable] 98 | NvtxParamsAllGather payload{sendcount * ncclTypeSize(datatype), datatype}; | ^~~~~~~ :/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc22: warning: unused variable 'payload' [-Wunused-variable] 166 | NvtxParamsAllToAll payload{count * ncclTypeSize(datatype), datatype}; | ^~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:10: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct:212:38: warning: unused variable 'AllToAllvSchema' [-Wunused-variable] 212 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:161:38: warning: unused variable 'AllToAllSchema' [-Wunused-variable] 161 | constexpr nvtxPayloadSchemaEntry_t AllToAllSchema[] = { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:166:22: warning: unused variable 'payload' [-Wunused-variable] 166 | NvtxParamsAllToAll payload{count * ncclTypeSize(datatype), datatype}; /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:126:45: warning: unused variable 'AllReduceSchema' [-Wunused-variable] 126 | static constexpr nvtxPayloadSchemaEntry_t AllReduceSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:132:23: warning: unused variable 'payload' [-Wunused-variable] 132 | NvtxParamsAllReduce payload{count * ncclTypeSize(datatype), op, datatype}; | ^~~~~~~ | ^~~~~~~ constexpr nvtxPa yncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:448:42: warning: unused variable 'SendRecvSchema' [-Wunused-const-variable] 448 | constexpr const nvtxPayloadSchemaEntry_t SendRecvSchema[] = { | ^~~~~~~~~~~~~~ loadSchemaEntry_t AllToAllvSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:219:23: warning: unused variable 'payload' [-Wunused-variable] 219 | NvtxParamsAllToAllv payload{sendcounts[comm->rank] * ncclTypeSize(datatype), recvcounts[comm->rank] * ncclTypeSi/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.ccze(datatype), /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:212:38: warning: unused variable 'AllToAllvSchema' [-Wunused-variable] 212 | constexpr nvtxPayloadSchemaEntry_t AllToAllvSchema[] = { | ^~~~~~~~~~~~~~~:212:38: warning: unused variable 'AllToAllvSchema' [-Wunused-variable] 212 | constexpr nvtxPayloadSchemaEntry_t Aldatlatype}T; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cco:261:38: warning: unused variable 'BroadcastSchema' [-Wunused-variable] 261 | constexpr nvtxPayloadSchemaEntryIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h_t BrAllvS:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:10: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:448:42: warning: unused variable 'SendRecvSchema' [-Wunused-const-variable] 448 | constexpr const nvtxPayloadSchemaEntry_t chema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:219:23: warning: unused variable 'payload' [-Wunused-variable] 219 | NvtxParamsAllToAllv payload{sendcounts[comm->rank] * ncclTypeSize(datatype), re c/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:219:23: warning: unused variable 'payload' [-Wunused-variable] 219 | NvtxParamsAllToAllv payload{sendcounts[comm->rank] * ncclTypeSize(datatype), recvcounts[comm->rank] * ncclTypeSize(datatype), datatype}; | ^~~~~~~ vcounts[comm->rank] * ncclTypeSize(datatype/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc), d:212:a38: warning: unused variable 'AllToAllvSchema' [-Wunused-variable] t 212 | aconstetxpr nvtxyPayloapdSchemeaEntr}; | ^~~~~~~ y_t AllToAllvSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:219:23: Swarning: endRecvSchema[] = { | ^~~~~~~~~~~~~~ unused variable 'payload' [-Wunused-variable] 219 | NvtxParamsAllToAllv payload{soeadcasntSchdema[c] =o { u | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.ccn:267:t23: warning: unused variable 'payload' [-Wunused-variable]s 267[ | NvctxPaoramsmBroadmcast- pay>loard{couantn * nkcclT]ypeS ize(*data typen), crootc, dlatatType}y; | p ^~~~~~~ eSize(datatype), recv/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:261:38: warning: unused variable 'BroadcastSchema' [-Wunused-variable] 261 | constexpr nvtxPayloadSchemaEntry_t BroadcastSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cccount:s[co267mm->:rank/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:161:38: warning: unused variable 'AllToAllSchema' [-Wunused-variable] 161 | constexpr nvtxPayloadSchemaEntry_t AllToAllSchema[] = { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:166:22: warning: unused variable 'payload' [-Wunused-variable] 166 | NvtxParamsAllToAll payload{count * ncclTypeSize(datatype), datatype}; | ^~~~~~~ 23] * :nccl Typewarning: Sizeunused variable 'payload' [-Wunused-variable](dat atyp e), data267type} | ; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc NvtxParamsBroadc:261:38: warning: unused variable 'BroadcastSchema' [-Wunused-variable] /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.ccast payload{count * ncclTypeSize(datatype), root, datatype}; | ^~~~~~~ 261 | constexpr nvtxPayloadSchemaEntry_t BroadcastSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:267:23: warning: unused variable 'payload' [-Wunused-variable] 267 | NvtxParamsBroadcast payload{count * ncclTypeS/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:261:38: warning: unused variable 'BroadcastSchema' [-Wunused-variable] 261 | constexpr nvtxPayloadSchemaEntry_t BroadcastSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:267:23: warning: unused variable 'payload' [-Wunused-variable] 267 | NvtxParamsBroadcast payload{count * ncclTypeSize(datatype), root, datatype}; | ^~~~~~~ ize(datatype), root, datatype}; | ^~~~~~~ :301:40: warning: unused variable 'GatherSchema' [-Wunused-variable] 301 | constexpr nvtxPayloadSchemIn file included from a/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:10: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:448:42: warning: unused variable 'SendRecvSchema' [-Wunused-const-variable] 448 | constexpr const nvtxPayloadSchemaEntry_t SendRecvSchema[] = { | ^~~~~~~~~~~~~~ Entry_t GatherSchema[] = { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:307:22: warning: unused variable 'payload' [-Wunused-variable] 307 | NvtxParamsGather payload{sendcount * ncclTypeSize(datatype), root, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:301:40: warning: unused variable 'GatherSchema' [-Wunused-variable] 301 | constexpr nvtxPayloadSchemaEntry_t GatherSche/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:343:38: warning: unused variable 'ReduceSchema' [-Wunused-variable] 343 | constexpr nvtxPayloadSchemaEntry_t ReduceSchema[]ma[] = { = | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc :307:{22: warning: unused variable 'payload' [-Wunused-variable] 307 | Nv| txPa ^~~~~~~~~~~~rams Gat/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.ccher payl:oad{351sen:dco20unt :* nc cl/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.ccwarning: unused variable 'payload' [-Wunused-variable] 351 | NvTytpeSixze(dPatatayramsReduce payload{count * ncclTypeSize(datatype), root, op, datatype}; | p ^~~~~~~e), root, d/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:212:38: warning: unused variable 'AllToAllvSchema' [-Wunused-variable] 212 | constexpr nvtxPayloadSchemaEntry_t AllToAllvSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:219:23: warning: unused variable 'payload' [-Wunused-variable] 219 | NvtxParamsAllToAllv payload{sendcounts[comm->rank] * ncclTypeSize(datatype), recvcounts[comm->rank] * ncclTypeSize(datatype), datatype}; | ^~~~~~~ atatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:301:40: warning: unused variable 'GatherSchema' [-Wunused-variable] 301 | constexpr nvtxPayloadSchemaEntry_t GatherSchema[] = { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:307:22: warning: unused variable 'payload' [-Wunused-variable] 307 | NvtxParamsGather payload{sendcount * ncclTypeSize(datatype), root:343:38: warning: unused variable 'ReduceSchema' [-Wunused-variable] 343 | constexpr nvtxPayloadSchemaEntry_t ReduceSchema[] = { , | data ^~~~~~~~~~~~type }; /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc| ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:351:20: warning: unused variable 'payload' [-Wunused-variable] 351 | NvtxPaIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:10: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:448:42: warning: unused variable 'SendRecvSchema' [-Wunused-const-variable] 448 | constexpr const nvtxPayloadSchemaEntry_t SendRecvSchema[] = { | ^~~~~~~~~~~~~~ ramsRedu/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.ccce p:a378:38:y warning: unused variable 'ReduceScatterSchema' [-Wunused-variable] l 378 | o coanstedxpr{ nvtcxPayoloaduSchenmaEnttry_ t Red*uceS catnterScchemac[] =l { T| ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.ccy:385p:27: warning: eunused variable 'payload' [-Wunused-variable] Size(datatype), root, op, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:261:38: warning: unused variable 'BroadcastSchema' [-Wunused-variable] 261 | constexpr nvtxPayloadSchemaEntry_t BroadcastSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:267:23: warning: unused variable 'payload' [-Wunused-variable] 267 | NvtxParamsBroadcast payload{count * ncclTypeSize(datatype), root, datatype}; | ^~~~~~~ :343:38: warning: unused variable 'ReduceSchema' [-Wunused-variable] 343 | constexpr nvtxPayloadSchemaEnt 385 | r NvytxPa_ramtsRed uceSRcatter payload{recvcount * ncclTypeSize(datatype), op, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cceduceSchema[] = :378:38: warning: unused variable 'ReduceScatterSchema' [-Wunused-variable] 378 | c{ | o ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:n351:20: warning: unused variable 'payload' [-Wunused-variable] 351s | NtvtxPearamxsRedupce praylo ad{cnount v* nctclTypexSize(Pdaatatyype),l rooot, oap, ddatatySpe};c | ^~~~~~~h /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.ccemaEntry_t ReduceScatterSchema[/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:301:40: warning: unused variable 'GatherSchema' [-Wunused-variable] 301 | constexpr nvtxPayloadSchemaEntry_t GatherSchema[] = { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:307:22: warning: unused variable 'payload' [-Wunused-variable] 307 | NvtxParamsGather payload{sendcount * ncclTypeSize(datatype), root, datatype}; | ^~~~~~~ ] = { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:385:27: warning: unused variable 'payload' [-Wunused-variable] 385 | NvtxParamsReduceScatter payload{recvcount * ncclTypeSize(datatype), op, datatype}; | ^~~~~~~ :378:38: warning: unused variable 'ReduceScatterSchema' [-Wunused-variable] 378 | constexpr nvtxPayloadSchemaEntry_t ReduceScatterSchema[] = { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:385:27: warning: unused variable 'payload' [-Wunused-variable] 385 | NvtxParamsReduceScatter payload{recvcount * ncclTypeSize(datatype), op, datatyp/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cce}; | ^~~~~~~ :412:40: warning: unused variable 'ScatterSchema' [-Wunused-variable] 412 | constexpr nvtxPayloadSchemaEntry_t ScatterSchema[] = { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:418:23: warning: unused variable 'payload' [-Wunused-variable] 418 | NvtxParamsScatter payload{recvcount * ncclTypeSize(datatype), root, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:461:22: warning: unused variable 'payload' [-Wunused-variable] 461 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:93:38: warning: unused variable 'AllGatherSchema' [-Wunused-variable] 93 | constexpr nv/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cctxPayloadSchemaEntry_t AllGatherSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:98:23: warning: unused variable 'payload' [-Wunused-variable] 98 | NvtxParamsAllGather payload{sendcount * ncclTypeSize(datatype), datatype}; | ^~~~~~~ :486:22: warning: unused variable 'payload' [-Wunused-variable] 486 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:343:38: warning: unused variable 'ReduceSchema' [-Wunused-variable] peer, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:412:40: warning: unused variable 'ScatterSchema' [-Wunused-variable] 412 | constexpr nvtxPayloadSchemaEntry_t ScatterSchema[] = { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:418:23: warning: unused variable 'payload' [-Wunused-variable] 418 | NvtxParamsScatter payload{recvcount * ncclTypeSize(datatype), root, datatype}; | ^~~~~~~ 343 | constexpr nvtxPayloadSchemaEntry_t ReduceSchema[] = { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:351:20: warning: unused variable 'payload' [-Wunused-variable] 351 | NvtxParamsReduce payload{count * ncclTypeSize(datatype), root, op, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:126:45: warning: unused variable 'AllReduceSchema' [-Wunused-variable] 126 | static constexpr nvtxPayloadSchemaEntry_t AllReduceSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:132/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:301:40: warning: unused variable 'GatherSchema' [-Wunused-variable] 301 | constexpr nvtxPayloadSchemaEntry_t GatherSchema[] = { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:307:22: warning: unused variable 'payload' [-Wunused-variable] 307 | NvtxParamsGather payload{sendcount * ncclTypeSize(datatype), root, datatype}; | ^~~~~~~ :23: warning: unused variable 'payload' [-Wunused-variable] 132 | NvtxParamsAllReduce payload{count * ncclTypeSize(datatype), op, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:412:40: warning: unused variable 'ScatterSchema' [-Wunused-variable] 412 | constexpr nvtxPayloadSchemaEntry_t ScatterSchema[] = { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:418:23: warning: unused variable 'payload' [-Wunused-variable] 418 | NvtxParamsScatter payload{recvcount * ncclTypeSize(datatype), root, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:343:38: warning: unused variable 'ReduceSchema' [-Wunused-variable] 343 | constexpr nvtxPayloadSchemaEntry_t ReduceSchema[] = { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:351:20: warning: unused variable 'payload' [-Wunused-variable] 351 | NvtxParamsReduce payload{count * ncclTypeSize(datatype), root, op, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:161:38: warning: unused variable 'AllToAllSchema' [-Wunused-variable] 161 | constexpr nvtxPayloadSchemaEntry_t AllToAllSchema[] = { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:166:22: warning: unused variable 'payload' [-Wunused-variable] 166 | NvtxParamsAllToAll payload{count * ncclTypeSize(datatype), datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:461:22: warning: unused variable 'payload' [-Wunused-variable] 461 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:378:38: warning: unused variable 'ReduceScatterSchema' [-Wunused-variable] 378 | constexpr nvtxPayloadSchemaEntry_t ReduceScatterSchema[] = { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:385:27: warning: unused variable 'payload' [-Wunused-variable] 385 | NvtxParamsReduceScatter payload{recvcount * ncclTypeSize(datatype), op, datatype}; | ^~~~~~~ :212:38: warning: unused variable 'AllToAllvSchema' [-Wunused-variable] /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:378:38: warning: unused variable 'ReduceScatterSchema' [-Wunused-variable] 378 | constexpr nvtxPayloadSchemaEntry_t ReduceScatterSchema[] = { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:385:27: warning: unused variable 'payload' [-Wunused-variable] 385 | NvtxParamsReduceScatter payload{recvcount * ncclTypeSize(datatype), op, datatype}; | ^~~~~~~ 212 | constexpr nvtxPayloadSchemaEntry_t AllToAllvSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:219:23: warning: unused variable 'payload' [-Wunused-variable] 219 | NvtxParamsAllToAllv payload{sendcounts[comm->rank] * ncclTypeSize(datatype), recvcounts[comm->rank] * ncclTypeSize(datatype), datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:261:38: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:461:22: warning: unused variable 'payload' [-Wunused-variable] 461 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer, datatype}; | ^~~~~~~ warning: unused variable 'BroadcastSchema' [-Wunused-variable] 261 | constexpr nvtxPayloadSchemaEntry_t BroadcastSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:267:23: warning: unused variable 'payload' [-Wunused-variable] 267 | NvtxParamsBroadcast payload{count * ncclTypeSize(datatype), root, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:486:22: warning: unused variable 'payload' [-Wunused-variable] 486 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:486:22: warning: unused variable 'payload' [-Wunused-variable] 486 | NvtxParamsSendRecv payload{count */builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:412:40: warning: unused variable 'ScatterSchema' [-Wunused-variable] 412 | constexpr nvtxPayloadSchemaEntry_t ScatterSchema[] = { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:418:23: warning: unused variable 'payload' [-Wunused-variable] 418 | NvtxParamsScatter payload{recvcount * ncclTypeSize(datatype), root, datatype}; | ^~~~~~~ ncclTypeSize(datatype),/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:412:40: warning: unused variable 'ScatterSchema' [-Wunused-variable] 412 | constexpr nvtxPayloadSchemaEntry_t ScatterSchema[] = { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:418:23: warning: unused variable 'payload' [-Wunused-variable] peer, d 418 | NvtxParamsScatter payload{recvcount * ncclTypeSize(datatype), root, datatype}; | ^~~~~~~ atatype/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc}; :301:40: warning: unused variable 'GatherSchema' [-Wunused-variable] | 301 | c ^~~~~~~onstex pr nvtxPayloadSchemaEntry_t GatherSchema[] = { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:307:22: warning: unused variable 'payload' [-Wunused-variable] 307 | NvtxParamsGather payload{sendcount * ncclTypeSize(datatype), root, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:343:38: warning: unused variable 'ReduceSchema' [-Wunused-variable] 343 | constexpr nvtxPayloadSchemaEntry_t ReduceSchem/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:461:22: warning: unused variable 'payload' [-Wunused-variable] 461 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer, datatype}; | ^~~~~~~ a[] = { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:351:20: warning: unused variable 'payload' [-Wunused-variable] 351 | NvtxParamsReduce payload{count * ncclTypeSize(datatype), root, op, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:378:38: warning: unused variable 'ReduceScatterSchema' [-Wunused-variable] 378 | constexpr nvtxPayloadSchemaEntry_t ReduceScatterSchema[] = { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:385:27: warning: unused variable 'payload' [-Wunused-variable] 385 | NvtxParamsReduceScatter payload{recvcount * ncclTypeSize(datatype), op, /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:461:22: warning: unused variable 'payload' [-Wunused-variable] 461 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer, datatype}; | ^~~~~~~ datatype}/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:486:22: warning: unused variable 'payload' [-Wunused-variable] 486 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer, datatype}; | ^~~~~~~ ; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:412:40: warning: unused variable 'ScatterSchema' [-Wunused-variable] 412 | constexpr nvtxPayloadSchemaEntry_t ScatterSchema[] = { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:418:23: warning: unused variable 'payload' [-Wunused-variable] 418 | NvtxParamsScatter payload{recvcount * ncclTypeSize(datatype), root, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:486:22: warning: unused variable 'payload' [-Wunused-variable] 486 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:461:22: warning: unused variable 'payload' [-Wunused-variable] 461 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:486:22: warning: unused variable 'payload' [-Wunused-variable] 486 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer, datatype}; | ^~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/debug.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:10: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:448:42: warning: unused variable 'SendRecvSchema' [-Wunused-const-variable] 448 | constexpr const nvtxPayloadSchemaEntry_t SendRecvSchema[] = { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:93:38: warning: unused variable 'AllGatherSchema' [-Wunused-variable] 93 | constexpr nvtxPayloadSchemaEntry_t AllGatherSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:98:23: warning: unused variable 'payload' [-Wunused-variable] 98 | NvtxParamsAllGather payload{sendcount * ncclTypeSize(datatype), datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:126:45: warning: unused variable 'AllReduceSchema' [-Wunused-variable] 126 | static constexpr nvtxPayloadSchemaEntry_t AllReduceSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:132:23: warning: unused variable 'payload' [-Wunused-variable] 132 | NvtxParamsAllReduce payload{count * ncclTypeSize(datatype), op, datatype}; | ^~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:10: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc236 | static ncclResult_t ncclTopoDevToRank(struct ncclT:161:38: warning: ounused variable 'AllToAllSchema' [-Wunused-variable] 161 | p consteoxpr System* system, int dev, int* nvtxPaylroadScheamaEntryn_t AllTokAllSche)m { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21a[] = { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:166:22: warning: unused variable 'payload' [-Wunused-variable] : warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_ 166 | t NvtxPa ramsAlliToAll pdayload{c,ount * n cclTypeiSize(dantatype)t* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:261:14:, da tatywarning: pe}; unused function 'ncclTopoXGMISpeed' [-Wunused-function] | ^~~~~~~ 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:448:42: warning: unused variable 'SendRecvSchema' [-Wunused-const-variable] 448 | constexpr const nvtxPayloadSche/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:212:38: warning: unused variable 'AllToAllvSchema' [-Wunused-variable] 212 | constexpr nvtxPayloadSchemaEntry_t AllToAllvSchema[] = { | maEntry_t SendRecvSchema[] = { | ^~~~~~~~~~~~~~ ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:219:23: warning: unused variable 'payload' [-Wunused-variable] 219 | NvtxParamsAllToAllv payload{sendcounts[comm->rank] * ncclTypeSize(datatype), recvcounts[comm->rank] * ncclTypeSize(datatype), datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:261:38: warning: unused variable 'BroadcastSchema' [-Wunused-variable] 261 | constexpr nvtxPayloadSchemaEntry_t BroadcastSchema[] = { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:267:23: warning: unused variable 'payload' [-Wunused-variable] 267 | NvtxParamsBroadcastIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/debug.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ payload{count * ncclTypeSize(datatype), root, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.ccIn file included from :301:40: warning: unused variable 'GatherSchema' [-Wunused-variable] 301 | constexpr nvtxPayloadSchemaEntry_t GatherSchema[] = { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:307:22: warning: unused variable 'payload' [-Wunused-variable] 307 | NvtxParamsGather payload{sendcount * ncclTypeSize(datatype), root, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:10: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:225:21: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:343:38: warning: unused variable 'ReduceSchema' [-Wunused-variable] 343 | constexpr nvtxPayloadSchemaEntry_t ReduceSchema[] = { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:351:20: warning: unused variable 'payload' [-Wunused-variable] 351 | NvtxParamsReduce payload{count * ncclwarning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, intIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:10: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:448:42: warning: unused variable 'SendRecvSchema' [-Wunused-const-variable] 448 | constexpr const nvtxPayloadSchemaEntry_t SendRecvSchema[] = { | ^~~~~~~~~~~~~~ * index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, iTypeSize(datatype), root, op, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:378:38: warning: unused variable 'ReduceScatterSchema' [-Wunused-variable] nt* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ 378 | constexpr nvtxPayloadSchemaEntry_t ReduceScatterSchema[] = { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:385:27: warning: unused variable 'payload' [-Wunused-variable] 385 | NvtxParamsReduceScatter payload{recvcoun/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:t271:14 : warning: *unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | sntatcic cflolat TnccylToppoNeVLiSnkBwi(inzt ceuda(ComdpCaap) t{ a| ^~~~~~~~~~~~~~~~t /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.hy:282p:13:e warning: unused function 'isPow2' [-Wunused-function]) 282, | st atioIn file included from p, datatype}; | ^~~~~~~ c bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:448:42: warning: unused variable 'SendRecvSchema' [-Wunused-const-variable] 448 | constexpr const nvtxPayloadSchemaEntry_t SendRecvSchema[] = { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/debug.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:412:40: warning: unused variable 'ScatterSchema' [-Wunused-variable] 412 | constexpr nvtxPayloadSchemaEntry_t ScatterSchema[] = { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:418:23: warning: unused variable 'payload' [-Wunused-variable] 418 | NvtxParamsScatter payload{recvcount * ncclTypeSize(datatype), root, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:461:22: warning: unused variable 'payload' [-Wunused-variable] 461 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer, datatype}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:486:22: warning: unused variable 'payload' [-Wunused-variable] 486 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer, datatype}; | ^~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:10: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:448:42: warning: unused variable 'SendRecvSchema' [-Wunused-const-variable] 448 | constexpr const nvtxPayloadSchemaEntry_t SendRecvSchema[] = { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:10: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:448:42: warning: unused variable 'SendRecvSchema' [-Wunused-const-variable] 448 | constexpr const nvtxPayloadSchemaEntry_t SendRecvSchema[] = { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/debug.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long logIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/debug.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ 2i(long n) { | ^~~~~ In file included from In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/debug.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:10: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclT/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/debug.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ opoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/collectives.cc:448:42: warning: unused variable 'SendRecvSchema' [-Wunused-const-variable] 448 | constexpr const nvtxPayloadSchemaEntry_t SendRecvSchema[] = { | ^~~~~~~~~~~~~~ 1 warning generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/debug.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/debug.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ 31 warnings generated when compiling for gfx1101. 31 warnings generated when compiling for gfx90a. 31 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/debug.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ 31 warnings generated when compiling for gfx1100. 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/debug.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ 31 warnings generated when compiling for gfx942. 31 warnings generated when compiling for gfx1201. 31 warnings generated when compiling for gfx906. 31 warnings generated when compiling for gfx1102. 31 warnings generated when compiling for gfx1030. 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx1201. 31 warnings generated when compiling for host. 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx906. 31 warnings generated when compiling for gfx1200. 1 warning generated when compiling for gfx1200. [ 45%] Building CXX object CMakeFiles/rccl.dir/hipify/src/enqueue.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/enqueue.cc.o -MF CMakeFiles/rccl.dir/hipify/src/enqueue.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/enqueue.cc.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx942. [ 45%] Building CXX object CMakeFiles/rccl.dir/hipify/src/group.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/group.cc.o -MF CMakeFiles/rccl.dir/hipify/src/group.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/group.cc.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/group.cc In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uintIn file included from 3/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 2_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ a; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/group.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/group.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/group.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/group.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/group.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/group.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/group.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/group.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/group.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/group.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/group.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/group.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/group.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/group.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/group.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/group.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/group.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/group.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/group.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/group.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/group.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/group.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/group.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/group.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/group.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/group.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/group.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/group.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/group.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/group.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/group.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/group.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/group.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/group.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/group.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/group.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/group.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/group.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/group.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/group.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/group.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/group.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/group.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/group.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ 2 warnings generated when compiling for gfx908. 2 warnings generated when compiling for gfx1100. 2 warnings generated when compiling for gfx1200. 2 warnings generated when compiling for gfx1030. 2 warnings generated when compiling for gfx1102. 2 warnings generated when compiling for gfx942. 2 warnings generated when compiling for gfx1101. 2 warnings generated when compiling for gfx1201. 2 warnings generated when compiling for gfx90a. 2 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc:16: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc:16: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc:16: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc:16: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc:16: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc:16: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t inf gdr_omh_t ;mh; | ^~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h :187:9:| warning: unused variable 'gdrMap' [-Wunused-variable] ^~~~187 | v /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdroi_d *gmdrMaph; | ^~~~~~_ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.ht mh:219;:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr| _mem ^~_desc_ t/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: *md =warning: (gunused variable 'gdrMap' [-Wunused-variable] 187 | void *dr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc:16: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ 2 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc:16: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc:16: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc:16: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc:16: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc:72:5: warning: unused label 'ignore0' [-Wunused-label] 72 | ignore0:; | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc:72:5: warning: unused label 'ignore0' [-Wunused-label] 72 | ignore0:; | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc:72:5: warning: unused label 'ignore0' [-Wunused-label] 72 | ignore0:; | ^~~~~~~~ [ 45%] Building CXX object CMakeFiles/rccl.dir/hipify/src/init.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/init.cc.o -MF CMakeFiles/rccl.dir/hipify/src/init.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/init.cc.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc:72:5: warning: unused label 'ignore0' [-Wunused-label] 72 | ignore0:; | ^~~~~~~~ :72:5: warning: unused label 'ignore0' [-Wunused-label] 72 | ignore0:; | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc:72:5: warning: unused label 'ignore0' [-Wunused-label] 72 | ignore0:; | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc:72:5: warning: unused label 'ignore0' [-Wunused-label] 72 | ignore0:; | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc:72:5: warning: unused label 'ignore0' [-Wunused-label] 72 | ignore0:; | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc:72:5: warning: unused label 'ignore0' [-Wunused-label] 72 | ignore0:; | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc:72:5: warning: unused label 'ignore0' [-Wunused-label] 72 | ignore0:; | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc:72:5: warning: unused label 'ignore0' [-Wunused-label] 72 | ignore0:; | ^~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc:12: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc:13: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc:16: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:224:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 224 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc:103:22: warning: unused function 'ncclFuncSendCount' [-Wunused-function] 103 | static inline size_t ncclFuncSendCount(ncclFunc_t func, int nRanks, size_t count) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc:106:22: warning: unused function 'ncclFuncRecvCount' [-Wunused-function] 106 | static inline size_t ncclFuncRecvCount(ncclFunc_t func, int nRanks, size_t count) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc:274:21: warning: unused function 'cleanupIpc' [-Wunused-function] 274 | static ncclResult_t cleanupIpc(struct ncclComm* comm, struct ncclCommCallback* cb) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc:1069:12: warning: unused function 'calcP2pChannelCount' [-Wunused-function] 1069 | static int calcP2pChannelCount(size_t totalSize, int minChannels, int maxChannels, size_t minSize, size_t maxSize) { | ^~~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc:12: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc:13: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc:16: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:224:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 224 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc:103:22: warning: unused function 'ncclFuncSendCount' [-Wunused-function] 103 | static inline size_t ncclFuncSendCount(ncclFunc_t func, int nRanks, size_t count) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc:106:22: warning: unused function 'ncclFuncRecvCount' [-Wunused-function] 106 | static inline size_t ncclFuncRecvCount(ncclFunc_t func, int nRanks, size_t count) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc:274:21: warning: unused function 'cleanupIpc' [-Wunused-function] 274 | static ncclResult_t cleanupIpc(struct ncclComm* comm, struct ncclCommCallback* cb) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc:1069:12: warning: unused function 'calcP2pChannelCount' [-Wunused-function] 1069 | static int calcP2pChannelCount(size_t totalSize, int minChannels, int maxChannels, size_t minSize, size_t maxSize) { | ^~~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc:12: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct nccIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc:12: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | staticlComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNet ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess;ReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc:12: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* c | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCH } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc:13: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc:16: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:224:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 224 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc:103:22: warning: unused function 'ncclFuncSendCount' [-Wunused-function] 103 | static inline size_t ncclFuncSendCount(ncclFunc_t func, int nRanks, size_t count) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc:106:22: warning: unused function 'ncclFuncRecvCount' [-Wunused-function] 106 | static inline size_t ncclFuncRecvCount(ncclFunc_t func, int nRanks, size_t count) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc:274:21: warning: unused function 'cleanupIpc' [-Wunused-function] 274 | static ncclResult_t cleanupIpc(struct ncclComm* comm, struct ncclCommCallback* cb) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc:1069:12: warning: unused function 'calcP2pChannelCount' [-Wunused-function] 1069 | static int calcP2pChannelCount(size_t totalSize, int minChannels, int maxChannels, size_t minSize, size_t maxSize) { | ^~~~~~~~~~~~~~~~~~~ ECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSucceomm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCss; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function]CLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc:13: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc:16: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrI 25 | static ncclResult_t collNetDeregIn file included from Mr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struc/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc:12: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc:12: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->nccm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:17:t nc21clCom:m* co mm, vwarning: oid* unused function 'collNetDevices' [-Wunused-function]collC omm, void* send17Dnit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:224:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 224 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc:103:22: warning: unused function 'ncclFuncSendCount' [-Wunused-function] 103 | static inline size_t ncclFuncSendCount(ncclFunc_t func, int nRanks, size_t count) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc:106:22: warning: unused function 'ncclFuncRecvCount' [-Wunused-function] 106 | static inline size_t ncclFuncRecvCount(ncclFunc_t func, int nRanks, size_t count) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc:274:21: warning: unused function 'cleanupIpc' [-Wunused-function] 274 | static ncclResult_t cleanupIpc(struct ncclComm* comm, struct ncclCommCallback* cb) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc:1069:12: warning: unused function 'calcP2pChannelCount' [-Wunused-function] 1069 | static int calcP2pChannelCount(size_t totalSize, int minChannels, int maxChannels, size_t minSize, size_t maxSize) { | ^~~~~~~~~~~~~~~~~~~ ata, | void* recvsData,t int acount,t nccliDatac ncclResult_t collNetDTypee_t davtaTypie, nccclRedeOp_t s(struredOp, cvto ncclComm* comm, int* id* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | statndev) { NCCLCHECK(colCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->nmm->ncclCollNet->deviccclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:30:21:es(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProic npcclReesrties(struct ncclComm* culot_t cmollNemtIflu,sh(str uct nciclComnm* cotmm, vo id* cdollCoemm, vvoid* d,ata, int snize, vcoid* cmhandlle, vNoid**e requtest) P{ NCCrLCHECoK(cperties_t* promom->ncclCollNet->iflush(collComm, data, size, mhandle, request))In file included from ;/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc:12: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] ps) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | stati creturn ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc:13: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc:16: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:224:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 224 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc:103:22: warning: unused function 'ncclFuncSendCount' [-Wunused-function] 103 | static inline size_t ncclFuncSendCount(ncclFunc_t func, int nRanks, size_t count) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc:106:22: warning: unused function 'ncclFuncRecvCount' [-Wunused-function] 106 | static inline size_t ncclFuncRecvCount(ncclFunc_t func, int nRanks, size_t count) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc:274:21: warning: unused function 'cleanupIpc' [-Wunused-function] 274 | static ncclResult_t cleanupIpc(struct ncclComm* comm, struct ncclCommCallback* cb) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc:1069:12: warning: unused function 'calcP2pChannelCount' [-Wunused-function] 1069 | static int calcP2pChannelCount(size_t totalSize, int minChannels, int maxChannels, size_t minSize, size_t maxSize) { | ^~~~~~~~~~~~~~~~~~~ ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->liste 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] n(dev, handle, listenComm)); return ncclSucces warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc:13: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc:16: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:224:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 224 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc:103:22: warning: unused function 'ncclFuncSendCount' [-Wunused-function] 103 | static inline size_t ncclFuncSendCount(ncclFunc_t func, int nRanks, size_t count) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc:106:22: warning: unused function 'ncclFuncRecvCount' [-Wunused-function] 106 | static inline size_t ncclFuncRecvCount(ncclFunc_t func, int nRanks, size_t count) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc:274:21: warning: unused function 'cleanupIpc' [-Wunused-function] 274 | static ncclResult_t cleanupIpc(struct ncclComm* comm, struct ncclCommCallback* cb) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc:1069:12: warning: unused function 'calcP2pChannelCount' [-Wunused-function] 1069 | static int calcP2pChannelCount(size_t totalSize, int minChannels, int maxChannels, size_t minSize, size_t maxSize) { | ^~~~~~~~~~~~~~~~~~~ s; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* codata, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetClosellComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResuIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc:12: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) lt_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uisize_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclColListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc:13: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc:16: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:224:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 224 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc:103:22: warning: unused function 'ncclFuncSendCount' [-Wunused-function] 103 | static inline size_t ncclFuncSendCount(ncclFunc_t func, int nRanks, size_t count) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc:106:22: warning: unused function 'ncclFuncRecvCount' [-Wunused-function] 106 | static inline size_t ncclFuncRecvCount(ncclFunc_t func, int nRanks, size_t count) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc:274:21: warning: unused function 'cleanupIpc' [-Wunused-function] 274 | static ncclResult_t cleanupIpc(struct ncclComm* comm, struct ncclCommCallback* cb) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc:1069:12: warning: unused function 'calcP2pChannelCount' [-Wunused-function] 1069 | static int calcP2pChannelCount(size_t totalSize, int minChannels, int maxChannels, size_t minSize, size_t maxSize) { | ^~~~~~~~~~~~~~~~~~~ lNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); returnt64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc:13: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, in ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | nt dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc:16: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:224:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 224 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc:103:22: warning: unused function 'ncclFuncSendCount' [-Wunused-function] 103 | static inline size_t ncclFuncSendCount(ncclFunc_t func, int nRanks, size_t count) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc:106:22: warning: unused function 'ncclFuncRecvCount' [-Wunused-function] 106 | static inline size_t ncclFuncRecvCount(ncclFunc_t func, int nRanks, size_t count) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc:274:21: warning: unused function 'cleanupIpc' [-Wunused-function] 274 | static ncclResult_t cleanupIpc(struct ncclComm* comm, struct ncclCommCallback* cb) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc:1069:12: warning: unused function 'calcP2pChannelCount' [-Wunused-function] 1069 | static int calcP2pChannelCount(size_t totalSize, int minChannels, int maxChannels, size_t minSize, size_t maxSize) { | ^~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc:13: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc:16: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:224:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 224 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc:103:22: warning: unused function 'ncclFuncSendCount' [-Wunused-function] 103 | static inline size_t ncclFuncSendCount(ncclFunc_t func, int nRanks, size_t count) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc:106:22: warning: unused function 'ncclFuncRecvCount' [-Wunused-function] 106 | static inline size_t ncclFuncRecvCount(ncclFunc_t func, int nRanks, size_t count) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc:274:21: warning: unused function 'cleanupIpc' [-Wunused-function] 274 | static ncclResult_t cleanupIpc(struct ncclComm* comm, struct ncclCommCallback* cb) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc:1069:12: warning: unused function 'calcP2pChannelCount' [-Wunused-function] 1069 | static int calcP2pChannelCount(size_t totalSize, int minChannels, int maxChannels, size_t minSize, size_t maxSize) { | ^~~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc:12: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc:13: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc:16: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:224:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 224 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc:103:22: warning: unused function 'ncclFuncSendCount' [-Wunused-function] 103 | static inline size_t ncclFuncSendCount(ncclFunc_t func, int nRanks, size_t count) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc:106:22: warning: unused function 'ncclFuncRecvCount' [-Wunused-function] 106 | static inline size_t ncclFuncRecvCount(ncclFunc_t func, int nRanks, size_t count) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc:274:21: warning: unused function 'cleanupIpc' [-Wunused-function] 274 | static ncclResult_t cleanupIpc(struct ncclComm* comm, struct ncclCommCallback* cb) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc:1069:12: warning: unused function 'calcP2pChannelCount' [-Wunused-function] 1069 | static int calcP2pChannelCount(size_t totalSize, int minChannels, int maxChannels, size_t minSize, size_t maxSize) { | ^~~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc:12: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc:13: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc:16: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:224:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 224 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc:103:22: warning: unused function 'ncclFuncSendCount' [-Wunused-function] 103 | static inline size_t ncclFuncSendCount(ncclFunc_t func, int nRanks, size_t count) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc:106:22: warning: unused function 'ncclFuncRecvCount' [-Wunused-function] 106 | static inline size_t ncclFuncRecvCount(ncclFunc_t func, int nRanks, size_t count) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc:274:21: warning: unused function 'cleanupIpc' [-Wunused-function] 274 | static ncclResult_t cleanupIpc(struct ncclComm* comm, struct ncclCommCallback* cb) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/enqueue.cc:1069:12: warning: unused function 'calcP2pChannelCount' [-Wunused-function] 1069 | static int calcP2pChannelCount(size_t totalSize, int minChannels, int maxChannels, size_t minSize, size_t maxSize) { | ^~~~~~~~~~~~~~~~~~~ 35 warnings generated when compiling for gfx1100. 35 warnings generated when compiling for gfx90a. 35 warnings generated when compiling for gfx1030. 35 warnings generated when compiling for gfx906. 35 warnings generated when compiling for gfx1102. 35 warnings generated when compiling for gfx1101. 35 warnings generated when compiling for gfx1200. 35 warnings generated when compiling for gfx942. 35 warnings generated when compiling for gfx1201. 35 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:In file included from In file included from 77:18: warning: unused variable 'y' [-Wunused-variable] 77 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ 35 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:219:19:: 14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14warning: unused variable 'md' [-Wunused-variable] 219 | gd: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | r_me ^~~~m_de sc_t/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h *md = (:gdr185_mem:_des12c_t*:)gdr Hawarning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mhndle_; | t ^~ mh; | ^~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ [ 46%] Building CXX object CMakeFiles/rccl.dir/hipify/src/init_nvtx.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/init_nvtx.cc.o -MF CMakeFiles/rccl.dir/hipify/src/init_nvtx.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/init_nvtx.cc.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init_nvtx.cc /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:1857:11: warning: unused variable 'stackSize' [-Wunused-variable] 1857 | int64_t stackSize; | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:1858:19: warning: unused variable 'devProp' [-Wunused-variable] 1858 | hipDeviceProp_t devProp; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:2264:26: warning: unused variable 'payload' [-Wunused-variable] 2264 | NvtxParamsCommInitRank payload{myrank, nranks, cudaDev}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:2278:38: warning: unused variable 'CommInitAllSchema' [-Wunused-variable] 2278 | constexpr nvtxPayloadSchemaEntry_t CommInitAllSchema[] = { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:2563:26: warning: unused variable 'payload' [-Wunused-variable] 2563 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:2598:26: warning: unused variable 'payload' [-Wunused-variable] 2598 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:1857:11: warning: unused variable 'stackSize' [-Wunused-variable] 1857 | int64_t stackSize; | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:1858:19: warning: unused variable 'devProp' [-Wunused-variable] 1858 | hipDeviceProp_t devProp; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:2264:26: warning: unused variable 'payload' [-Wunused-variable] 2264 | NvtxParamsCommInitRank payload{myrank, nranks, cudaDev}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:2278:38: warning: unused variable 'CommInitAllSchema' [-Wunused-variable] 2278 | constexpr nvtxPayloadSchemaEntry_t CommInitAllSchema[] = { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:1857:11: warning: unused variable 'stackSize' [-Wunused-variable] 1857 | int64_t stackSize; | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:1858:19: warning: unused variable 'devProp' [-Wunused-variable] 1858 | hipDeviceProp_t devProp; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:1857:11: warning: unused variable 'stackSize' [-Wunused-variable] 1857 | int64_t stackSize; | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:1858:19: warning: unused variable 'devProp' [-Wunused-variable] 1858 | hipDeviceProp_t devProp; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:1857:11: warning: unused variable 'stackSize' [-Wunused-variable] 1857 | int64_t stackSize; | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:1858:19: warning: unused variable 'devProp' [-Wunused-variable] 1858 | hipDeviceProp_t devProp; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:2563:26: warning: unused variable 'payload' [-Wunused-variable] 2563 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:2264:26: warning: unused variable 'payload' [-Wunused-variable] 2264 | NvtxParamsCommInitRank payload{myrank, nran/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:2264:26: warning: unused variable 'payload' [-Wunused-variable] 2264 | NvtxParamsCommInitRank payload{myrank, nranks, cudaDev}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:2264:26: warning: unused variable 'payload' [-Wunused-variable] 2264 | NvtxParamsCommInitRank payload{myrank, nranks, cudaDev}; | ^~~~~~~ ks, cudaDev}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:2598:26: warning: unused variable 'payload' [-Wunused-variable] 2598 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:2278:38: warning: unused variable 'CommInitAllSchema' [-Wunused-variable] 2278 | constexpr nvtxPayloadSchemaEntry_t CommInitAllSchema[] = { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:2278:38: warning: unused variable 'CommInitAllSchema' [-Wunused-variable] 2278 | constexpr nvtxPayloadSchemaEntry_t CommInitAllSchema[] = { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:2278:38: warning: unused variable 'CommInitAllSchema' [-Wunused-variable] 2278 | constexpr nvtxPayloadSchemaEntry_t CommInitAllSchema[] = { | ^~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:19: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:1857:11: warning: unused variable 'stackSize' [-Wunused-variable] 1857 | int64_t stackSize; | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:1858:19: warning: unused variable 'devProp' [-Wunused-variable] 1858 | hipDeviceProp_t devProp; | ^~~~~~~ _t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:39: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:40: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXm/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:2563:26: warning: unused variable 'payload' [-Wunused-variable] 2563 | NvtxParamsCommInitRanklNode* * node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* nodepayload{rank, nranks, cudaDev}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:2563:26: warning: unused variable 'payload' [-Wunused-variable] 2563 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:86:21: warning: unused function 'commReclaim' [-Wunused-function] 86 | static ncclResult_t commReclaim(ncclComm_t comm); | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:2249:36: warning: unused variable 'CommInitRankSchema' [-Wunused-const-variable] 2249 | constexpr nvtxPayloadSchemaEntry_t CommInitRankSchema[] = { | ^~~~~~~~~~~~~~~~~~ :2598:26: warning: unused variable 'payload' [-Wunused-variable] 2598/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:2598:26: warning: unused variable 'payload' [-Wunused-variable] 2598 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ | NvtxP/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:2563:26: warning: unused variable 'payload' [-Wunused-variable] 2563 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ aramsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:2598:26: warning: unused variable 'payload' [-Wunused-variable] 2598 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:1857:11: warning: unused variable 'stackSize' [-Wunused-variable] 1857 | int64_t stackSize; | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:1858:19: warning: unused variable 'devProp' [-Wunused-variable] 1858 | hipDeviceProp_t devProp; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:2264:26: warning: unused variable 'payload' [-Wunused-variable] 2264 | NvtxParamsCommInitRank payload{myrank, nranks, cudaDev}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:2278:38: warning: unused variable 'CommInitAllSchema' [-Wunused-variable] 2278 | constexpr nvtxPayloadSchemaEntry_t CommInitAllSchema[] = { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:1857:11: warning: unused variable 'stackSize' [-Wunused-variable] 1857 | int64_t stackSize; | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:1858:19: warning: unused variable 'devProp' [-Wunused-variable] 1858 | hipDeviceProp_t /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:2264:26: warning: unused variable 'payload' [-Wunused-variable] devProp; | 2264 | NvtxParamsCommInitRank payload{myrank, nranks, cudaDev}; | ^~~~~~~ ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:2278:38: warning: unused variable 'CommInitAllSchema' [-Wunused-variable] 2278 | constexpr nvtxPayloadSchemaEntry_t CommInitAllSchema[] = { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:2563:26: warning: unused variable 'payload' [-Wunused-variable] 2563 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:2598:26: warning: unused variable 'payload' [-Wunused-variable] 2598 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:2264:26: warning: unused variable 'payload' [-Wunused-variable] 2264 | NvtxParamsCommInitRank payload{myrank, nranks, cudaDev}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:1857:11: warning: unused variable 'stackSize' [-Wunused-variable] 1857 | int64_t stackSize; | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:1858:19: warning: unused variable 'devProp' [-Wunused-variable] 1858 | hipDeviceProp_t devProp; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:1857:11: warning: unused variable 'stackSize' [-Wunused-variable] 1857 | int64_t stackSize; | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:1858:19: warning: unused variable 'devProp' [-Wunused-variable] 1858 | hipDeviceProp_t devProp; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:2563:26: warning: unused variable 'payload' [-Wunused-variable] 2563 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:2278:38: warning: unused variable 'CommInitAllSchema' [-Wunused-variable] 2278 | constexpr nvtxPayloadSchemaEntry_t CommInitA/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:2598:26: warning: unused variable 'payload' [-Wunused-variable] 2598 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ llSchema[] = { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:2264:26: warning: unused variable 'payload' [-Wunused-variable] 2264 | NvtxParamsCommInitRank payload{myrank, nranks, cudaDev}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:2264:26: warning: unused variable 'payload' [-Wunused-variable] 2264 | NvtxParamsCommInitRank payload{myrank, nranks, cudaDev}; | ^~~~~~~ :1857:11: warning: unused variable 'stackSize' [-Wunused-variable] 1857 | int64_t stackSize; | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:1858:19: warning: unused variable 'devProp' [-Wunused-variable] 1858 | hipDeviceProp_t devProp; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:2278:38: warning: unused variable 'CommInitAllSchema' [-Wunused-variable] 2278 | constexpr nvtxPayloadSchemaEntry_t CommInitAllSchema[] = { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:2278:38: warning: unused variable 'CommInitAllSchema' [-Wunused-variable] 2278 | constexpr nvtxPayloadSchemaEntry_t CommInitAllSchema[] = { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:2563:26: warning: unused variable 'payload' [-Wunused-variable] 2563 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:2598:26: warning: unused variable 'payload' [-Wunused-variable] 2598 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:19: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:39: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:40: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:2264:26: warning: unused variable 'payload' [-Wunused-variable] 2264 | NvtxParamsCommI/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNonitRank payload{myrank, nranks, cudaDev}; | ^~~~~~~ de) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:86:21: warning: unused function 'commReclaim' [-Wunused-function] 86 | static ncclResult_t commReclaim(ncclComm_t comm); | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:2249:36: warning: unused variable 'CommInitRankSchema' [-Wunused-const-variable] 2249 | constexpr nvtxPayloadSchemaEntry_t CommInitRankSchema[] = { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:2278:38: warning: unused variable 'CommInitAllSchema' [-Wunused-variable] 2278 | constexpr nvtxPa/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:2563:26: warning: unused variable 'payload' [-Wunused-variable] 2563 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ yloadSchemaEntry_t CommInitAllSchema[] = { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:2563:26: warning: unused variable 'payload' [-Wunused-variable] 2563 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:2598:26: warning: unused variable 'payload' [-Wunused-variable] 2598 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:19: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, tIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:19: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:16:20ype, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:39: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:40: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function]: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { r/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:2598:26: warning: unused variable 'payload' [-Wunused-variable] 2598 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ eturn comm->ncclCollNet->nameIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:19: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* li; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclRes u 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struclt_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNestenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(commtProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncc->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:39: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:40: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNodlComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->nccltC kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:86:21: warning: unused function 'commReclaim' [-Wunused-function] 86 | static ncclResult_t commReclaim(ncclComm_t comm); | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:2249:36: warning: unused variable 'CommInitRankSchema' [-Wunused-const-variable] 2249 | constexpr nvtxPayloadSchemaEntry_t CommInitRankSchema[] = { | ^~~~~~~~~~~~~~~~~~ ollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listee* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:86:21: warning: unused function 'commReclaim' [-Wunused-function] 86 | static ncclResult_t commReclaim(ncclComm_t comm); | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:2249:36: warning: unused variable 'CommInitRankSchema' [-Wunused-const-variable] 2249 | constexpr nvtxPayloadSchemaEntry_t CommInitRankScnComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, hema[] = { | ^~~~~~~~~~~~~~~~~~ redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncc/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:2563:26: warning: unused variable 'payload' [-Wunused-variable] 2563 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ lComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size,/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:2598:26: warning: unused variable 'payload' [-Wunused-variable] 2598 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:39: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:40: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:86:21: warning: unused function 'commReclaim' [-Wunused-function] 86 | static ncclResult_t commReclaim(ncclComm_t comm); | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:2249:36: warning: unused variable 'CommInitRankSchema' [-Wunused-const-variable] 2249 | constexpr nvtxPayloadSchemaEntry_t CommInitRankSchema[] = { | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:19: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | statiIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static c ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:25:21:long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:19: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncc warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:39: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:40: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefalComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCult(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct nLCHECK(comm->ncclCollNet->getProperties(dev, props)); returcclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t valuen ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); ) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:86:21: warning: unused function 'commReclaim' [-Wunused-function] 86 | static ncclResult_t commReclaim(ncclComm_t comm); | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:2249:36: warning: unused variable 'CommInitRankSchema' [-Wunused-const-variable] 2249 | constexpr nvtxPayloadSchemaEntry_t CommInitRankSchema[] = { | ^~~~~~~~~~~~~~~~~~ return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:19: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | statiComm, void* mhandle) { NCCLCHECK(comm->ncc ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank,clCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h: void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, liste26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static nComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECKncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle,(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSu void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccesccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:39: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:40: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const chs; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int*ar* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrN size) { NCCLCHECK(comm->ncame, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml,clCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHEC const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertTK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:39: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRanoStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:86:21: warning: unused function 'commReclaim' [-Wunused-function] 86 | static ncclResult_t commReclaim(ncclComm_t comm); | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:2249:36: warning: unused variable 'CommInitRankSchema' [-Wunused-const-variable] 2249 | constexpr nvtxPayloadSchemaEntry_t CommInitRankSchema[] = { | ^~~~~~~~~~~~~~~~~~ kToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:40: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:86:21: warning: unused function 'commReclaim' [-Wunused-function] 86 | static ncclResult_t commReclaim(ncclComm_t comm); | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:2249:36: warning: unused variable 'CommInitRankSchema' [-Wunused-const-variable] 2249 | constexpr nvtxPayloadSchemaEntry_t CommInitRankSchema[] = { | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:19: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:39: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* systIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:19: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, inem, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:40: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~57 warnings generated when compiling for gfx1102. t type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:39: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:86:21: warning: unused function 'commReclaim' [-Wunused-function] 86 | static ncclResult_t commReclaim(ncclComm_t comm); | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:2249:36: warning: unused variable 'CommInitRankSchema' [-Wunused-const-variable] 2249 | constexpr nvtxPayloadSchemaEntry_t CommInitRankSchema[] = { | ^~~~~~~~~~~~~~~~~~ warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:40: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:86:21: warning: unused function 'commReclaim' [-Wunused-function] 86 | static ncclResult_t commReclaim(ncclComm_t comm); | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:2249:36: warning: unused variable 'CommInitRankSchema' [-Wunused-const-variable] 2249 | constexpr nvtxPayloadSchemaEntry_t CommInitRankSchema[] = { | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:19: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:39: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:40: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:86:21: warning: unused function 'commReclaim' [-Wunused-function] 86 | static ncclResult_t commReclaim(ncclComm_t comm); | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init.cc:2249:36: warning: unused variable 'CommInitRankSchema' [-Wunused-const-variable] 2249 | constexpr nvtxPayloadSchemaEntry_t CommInitRankSchema[] = { | ^~~~~~~~~~~~~~~~~~ 57 warnings generated when compiling for gfx90a. 57 warnings generated when compiling for gfx942. 57 warnings generated when compiling for gfx1100. 57 warnings generated when compiling for gfx1030. 57 warnings generated when compiling for gfx1200. 57 warnings generated when compiling for gfx908. 57 warnings generated when compiling for gfx1101. 57 warnings generated when compiling for gfx906. 57 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init_nvtx.cc:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/roctx.h:18: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init_nvtx.cc:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/roctx.h:18: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init_nvtx.cc:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/roctx.h:18: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init_nvtx.cc:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/roctx.h:18: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init_nvtx.cc:4:42: warning: unused variable 'NvtxEnumRedSchema' [-Wunused-const-variable] 4 | static constexpr const nvtxPayloadEnum_t NvtxEnumRedSchema[] = { | ^~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init_nvtx.cc:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/roctx.h:18: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init_nvtx.cc:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/roctx.h:18: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init_nvtx.cc:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/roctx.h:18: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init_nvtx.cc:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/roctx.h:18: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init_nvtx.cc:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/roctx.h:18: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init_nvtx.cc:4:42: warning: unused variable 'NvtxEnumRedSchema' [-Wunused-const-variable] 4 | static constexpr const nvtxPayloadEnum_t NvtxEnumRedSchema[] = { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init_nvtx.cc:4:42: warning: unused variable 'NvtxEnumRedSchema' [-Wunused-const-variable] 4 | static constexpr const nvtxPayloadEnum_t NvtxEnumRedSchema[] = { | ^~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init_nvtx.cc:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/roctx.h:18: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init_nvtx.cc:4:42: warning: unused variable 'NvtxEnumRedSchema' [-Wunused-const-variable] 4 | static constexpr const nvtxPayloadEnum_t NvtxEnumRedSchema[] = { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init_nvtx.cc:4:42: warning: unused variable 'NvtxEnumRedSchema' [-Wunused-const-variable] 4 | static constexpr const nvtxPayloadEnum_t NvtxEnumRedSchema[] = { | ^~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init_nvtx.cc:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/nvtx.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/roctx.h:18: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init_nvtx.cc:4:42: warning: unused variable 'NvtxEnumRedSchema' [-Wunused-const-variable] 4 | static constexpr const nvtxPayloadEnum_t NvtxEnumRedSchema[] = {/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init_nvtx.cc:4:42: warning: unused variable 'NvtxEnumRedSchema' [-Wunused-const-variable] 4 | static constexpr const nvtxPayloadEnum_t NvtxEnumRedSchema[] = { | ^~~~~~~~~~~~~~~~~ | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init_nvtx.cc:4:42: warning: unused variable 'NvtxEnumRedSchema' [-Wunused-const-variable] 4 | static constexpr const nvtxPayloadEnum_t NvtxEnumRedSchema[] = { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init_nvtx.cc:4:42: warning: unused variable 'NvtxEnumRedSchema' [-Wunused-const-variable] 4 | static constexpr const nvtxPayloadEnum_t NvtxEnumRedSchema[] = { | ^~~~~~~~~~~~~~~~~ 2 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init_nvtx.cc:4:42: warning: unused variable 'NvtxEnumRedSchema' [-Wunused-const-variable] 4 | static constexpr const nvtxPayloadEnum_t NvtxEnumRedSchema[] = { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/init_nvtx.cc2 warnings generated when compiling for gfx942. :4:42: warning: unused variable 'NvtxEnumRedSchema' [-Wunused-const-variable] 4 | static constexpr const nvtxPayloadEnum_t NvtxEnumRedSchema[] = { | ^~~~~~~~~~~~~~~~~ 2 warnings generated when compiling for gfx1101. 2 warnings generated when compiling for gfx1100. 2 warnings generated when compiling for gfx1201. 2 warnings generated when compiling for gfx1030. 2 warnings generated when compiling for gfx908. 2 warnings generated when compiling for gfx1200. 2 warnings generated when compiling for gfx90a. 2 warnings generated when compiling for gfx1102. 2 warnings generated when compiling for gfx906. [ 46%] Building CXX object CMakeFiles/rccl.dir/hipify/src/net.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/net.cc.o -MF CMakeFiles/rccl.dir/hipify/src/net.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/net.cc.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/net.cc 57 warnings generated when compiling for host. [ 46%] Building CXX object CMakeFiles/rccl.dir/hipify/src/msccl.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/msccl.cc.o -MF CMakeFiles/rccl.dir/hipify/src/msccl.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/msccl.cc.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/msccl.cc In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/net.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/net.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/net.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/net.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/net.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/net.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/net.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/net.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/net.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/net.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/net.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/net.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/net.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/net.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/net.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/net.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/net.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/net.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/net.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/net.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/net.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/net.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/msccl.cc:6: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/msccl.cc:6: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/msccl.cc:6: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/msccl.cc:6: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 2 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/msccl.cc:6: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/msccl.cc:6: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 2 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/msccl.cc:6: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa;In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/msccl.cc:6: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | | ^ uint32_t y, head, mant2 warnings generated when compiling for gfx1200. issa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/msccl.cc:6: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/msccl.cc:6: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/msccl.cc:6: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 2 warnings generated when compiling for gfx1101. 2 warnings generated when compiling for host. 2 warnings generated when compiling for gfx1102. 2 warnings generated when compiling for gfx906. 2 warnings generated when compiling for gfx90a. 2 warnings generated when compiling for gfx942. 22 warnings generated when compiling for gfx1030. warnings generated when compiling for gfx908. /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/msccl.cc:54:38: warning: unused variable 'MscclSchema' [-Wunused-variable] 54 | constexpr nvtxPayloadSchemaEntry_t MscclSchema[] = { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/msccl.cc:61:19: warning: unused variable 'payload' [-Wunused-variable] 61 | NvtxParamsMsccl payload{count * ncclTypeSize(dataType), op, dataType}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/msccl.cc:54:38: warning: unused variable 'MscclSchema' [-Wunused-variable] 54 | constexpr nvtxPayloadSchemaEntry_t MscclSchema[] = { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/msccl.cc:61:19: warning: unused variable 'payload' [-Wunused-variable] 61 | NvtxParamsMsccl payload{count * ncclTypeSize(dataType), op, dataType}; | ^~~~~~~ [ 46%] Building CXX object CMakeFiles/rccl.dir/hipify/src/proxy.cc.o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/msccl.cc/usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/proxy.cc.o -MF CMakeFiles/rccl.dir/hipify/src/proxy.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/proxy.cc.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/proxy.cc :54:38: warning: unused variable 'MscclSchema' [-Wunused-variable] 54 | constexpr nvtxPayloadSchemaEntry_t MscclSchema[] = { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/msccl.cc:61:19: warning: unused variable 'payload' [-Wunused-variable] 61 | NvtxParamsMsccl payload{count * ncclTypeSize(dataType), op, dataType}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/msccl.cc:54:38: warning: unused variable 'MscclSchema' [-Wunused-variable] 54 | constexpr nvtxPayloadSchemaEntry_t MscclSchema[] = { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/msccl.cc:61:19: warning: unused variable 'payload' [-Wunused-variable] 61 | NvtxParamsMsccl payload{count * ncclTypeSize(dataType), op, dataType}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/msccl.cc:54:38: warning: unused variable 'MscclSchema' [-Wunused-variable] 54 | constexpr nvtxPayloadSchemaEntry_t MscclSchema[] = { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/msccl.cc:61:19: warning: unused variable 'payload' [-Wunused-variable] 61 | NvtxParamsMsccl payload{count * ncclTypeSize(dataType), op, dataType}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/msccl.cc:54:38: warning: unused variable 'MscclSchema' [-Wunused-variable] 54 | constexpr nvtxPayloadSchemaEntry_t MscclSchema[] = { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/msccl.cc:61:19: warning: unused variable 'payload' [-Wunused-variable] 61 | NvtxParamsMsccl payload{count * ncclTypeSize(dataType), op, dataType}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/msccl.cc:54:38: warning: unused variable 'MscclSchema' [-Wunused-variable] 54 | constexpr nvtxPayloadSchemaEntry_t MscclSchema[] = { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/msccl.cc:61:19: warning: unused variable 'payload' [-Wunused-variable] 61 | NvtxParamsMsccl payload{count * ncclTypeSize(dataType), op, dataType}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/msccl.cc:54:38: warning: unused variable 'MscclSchema' [-Wunused-variable] 54 | constexpr nvtxPayloadSchemaEntry_t MscclSchema[] = { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/msccl.cc:61:19: warning: unused variable 'payload' [-Wunused-variable] 61 | NvtxParamsMsccl payload{count * ncclTypeSize(dataType), op, dataType}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/msccl.cc:54:38: warning: unused variable 'MscclSchema' [-Wunused-variable] 54 | constexpr nvtxPayloadSchemaEntry_t MscclSchema[] = { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/msccl.cc:61:19: warning: unused variable 'payload' [-Wunused-variable] 61 | NvtxParamsMsccl payload{count * ncclTypeSize(dataType), op, dataType}; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/msccl.cc:54:38: warning: unused variable 'MscclSchema' [-Wunused-variable] 54 | constexpr nvtxPayloadSchemaEntry_t MscclSchema[] = { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/msccl.cc:61:19: warning: unused variable 'payload' [-Wunused-variable] 61 | NvtxParamsMsccl payload{count * ncclTypeSize(dataType), op, dataType}; | ^~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/msccl.cc:6: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/msccl.cc:7: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/msccl.cc:6: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/msccl.cc:7: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/msccl.cc/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/msccl.cc:6: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/msccl.cc:7: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** :54:38: warning: nunused variable 'MscclSchema' [-Wunused-variable] 54 | coonstexprd nvtxPaeyloadSch)emaEntry_ t MscclSchema[] = {{ | | ^~~~~~~~~~~ ^~~~~~~~~~~~~~~/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/msccl.cc :61:19: warning: unused variable 'payload' [-Wunused-variable] 61 | NvtxParamsMsccl payload{count * ncclTypeSize(dataType), op, dataType}; | ^~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/msccl.cc:6: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/msccl.cc:7: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/msccl.cc:6: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/msccl.cc:7: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/msccl.cc:6: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/msccl.cc:7: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/msccl.cc:6: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/msccl.cc:7: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/msccl.cc:6: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/msccl.cc:7: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/msccl.cc:6: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/msccl.cc:7: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/msccl.cc:6: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/msccl.cc:7: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclIn file included from Result_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/msccl.cc:6: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/enqueue.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/msccl.cc:7: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx1201. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx1200. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx942. [ 47%] Building CXX object CMakeFiles/rccl.dir/hipify/src/register.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/register.cc.o -MF CMakeFiles/rccl.dir/hipify/src/register.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/register.cc.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/register.cc In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/proxy.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/proxy.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/proxy.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/proxy.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/proxy.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/proxy.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/proxy.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable]In file included from 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/proxy.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/proxy.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/proxy.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/proxy.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/proxy.cc:289:7: warning: variable 'sublist_len' set but not used [-Wunused-but-set-variable] 289 | int sublist_len = 0; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/proxy.cc:289:7: warning: variable 'sublist_len' set but not used [-Wunused-but-set-variable] 289 | int sublist_len = 0; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/proxy.cc:289:7: warning: variable 'sublist_len' set but not used [-Wunused-but-set-variable] 289 | int sublist_len = 0; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/proxy.cc:289:7: warning: variable 'sublist_len' set but not used [-Wunused-but-set-variable] 289 | int sublist_len = 0; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/proxy.cc:289:7: warning: variable 'sublist_len' set but not used [-Wunused-but-set-variable] 289 | int sublist_len = 0; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/proxy.cc:289:7: warning: variable 'sublist_len' set but not used [-Wunused-but-set-variable] 289 | int sublist_len = 0; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/proxy.cc:289:7: warning: variable 'sublist_len' set but not used [-Wunused-but-set-variable] 289 | int sublist_len = 0; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/proxy.cc/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/proxy.cc:289:7: warning: variable 'sublist_len' set but not used [-Wunused-but-set-variable] 289 | int sublist_len = 0; | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/proxy.cc ^ :289:7: warning: variable 'sublist_len' set but not used [-Wunused-but-set-variable] 289 | int sublist_len = 0; | ^ :289:7: warning: variable 'sublist_len' set but not used [-Wunused-but-set-variable] 289 | int sublist_len = 0; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/proxy.cc:289:7: warning: variable 'sublist_len' set but not used [-Wunused-but-set-variable] 289 | int sublist_len = 0; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/proxy.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/proxy.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/proxy.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/proxy.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/proxy.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/proxy.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/proxy.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:In file included from 14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/proxy.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | s/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/proxy.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: twarning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ atic long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/proxy.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/proxy.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ 3 warnings generated when compiling for gfx1102. 3 warnings generated when compiling for gfx906. 3 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/register.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 3 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/register.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/register.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/register.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 3In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/register.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/register.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/register.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 3 warnings generated when compiling for gfx1200. 3 warnings generated when compiling for gfx1101. 3 warnings generated when compiling for gfx1201. 3 warnings generated when compiling for gfx90a. 3 warnings generated when compiling for gfx1100. In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/register.cc/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/register.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ :7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/register.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/register.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/register.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/register.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/register.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/register.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/register.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/register.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/register.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/register.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/register.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/register.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/register.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ 2 warnings generated when compiling for gfx1030. 2 warnings generated when compiling for gfx906. 2 warnings generated when compiling for host. 2 warnings generated when compiling for gfx1101. 2 warnings generated when compiling for gfx1100. 2 warnings generated when compiling for gfx942. 2 warnings generated when compiling for gfx90a. 22 warnings generated when compiling for gfx1102. warnings generated when compiling for gfx1200. 2 warnings generated when compiling for gfx1201. 2 warnings generated when compiling for gfx908. [ 47%] Building CXX object CMakeFiles/rccl.dir/hipify/src/transport.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/transport.cc.o -MF CMakeFiles/rccl.dir/hipify/src/transport.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/transport.cc.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport.cc 3 warnings generated when compiling for host. [ 47%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/common.cu.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/common.cu.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/common.cu.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/common.cu.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.cu.cpp In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.cu.cpp:8: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.cu.cpp:8: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.cu.cpp:8: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.cu.cpp:8: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.cu.cpp:8: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.cu.cpp:8: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.cu.cpp:8: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.cu.cpp:8: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.cu.cpp:8: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.cu.cpp:8: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.cu.cpp:8: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ 2 warnings generated when compiling for gfx1101. 2 warnings generated when compiling for gfx90a. 2 warnings generated when compiling for gfx1102. 2 warnings generated when compiling for gfx1030. 2 warnings generated when compiling for gfx1200. 2 warnings generated when compiling for gfx1100. 2 warnings generated when compiling for gfx906. 2 warnings generated when compiling for gfx1201. 2 warnings generated when compiling for gfx942. 2 warnings generated when compiling for gfx908. 1 warning generated when compiling for host. 2 warnings generated when compiling for host. [ 47%] Building CXX object CMakeFiles/rccl.dir/hipify/src/device/onerank.cu.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/device/onerank.cu.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/device/onerank.cu.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/device/onerank.cu.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/onerank.cu.cpp In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/onerank.cu.cpp:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/onerank.cu.cpp:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/onerank.cu.cpp:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/onerank.cu.cpp:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/onerank.cu.cpp:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/onerank.cu.cpp:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/onerank.cu.cpp:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/onerank.cu.cpp:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/onerank.cu.cpp:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/onerank.cu.cpp:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/onerank.cu.cpp:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/onerank.cu.cpp:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/onerank.cu.cpp:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ 2 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/onerank.cu.cpp:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/onerank.cu.cpp:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/onerank.cu.cpp:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/onerank.cu.cpp:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/onerank.cu.cpp:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/onerank.cu.cpp:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/onerank.cu.cpp:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/onerank.cu.cpp:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/onerank.cu.cpp:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx942. 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx1201. 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx1200. [ 48%] Building CXX object CMakeFiles/rccl.dir/hipify/src/graph/connect.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/graph/connect.cc.o -MF CMakeFiles/rccl.dir/hipify/src/graph/connect.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/graph/connect.cc.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/connect.cc 2 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/connect.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/connect.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/connect.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/connect.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/connect.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/connect.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/connect.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/connect.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/connect.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/connect.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/connect.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 2 warnings generated when compiling for gfx908. /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/connect.cc:124:12: warning: unused variable 'y' [-Wunused-variable] 124 | int x=0, y=0; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/connect.cc:131:7: warning: unused variable 'localRanks' [-Wunused-variable] 131 | int localRanks = comm->topo->nodes[GPU].count; | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/connect.cc:124:12: warning: unused variable 'y' [-Wunused-variable] 124 | int x=0, y=0; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/connect.cc:131:7: warning: unused variable 'localRanks' [-Wunused-variable] 131 | int localRanks = comm->topo->nodes[GPU].count; | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/connect.cc:124:12: warning: unused variable 'y' [-Wunused-variable] 124 | int x=0, y=0; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/connect.cc:131:7: warning: unused variable 'localRanks' [-Wunused-variable] 131 | int localRanks = comm->topo->nodes[GPU].count; | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/connect.cc/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/connect.cc:124:12: warning: unused variable 'y' [-Wunused-variable] 124 | int x=0, y=0; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/connect.cc:131:7: warning: unused variable 'localRanks' [-Wunused-variable] 131 | int localRanks = comm->topo->nodes[GPU].count; | ^~~~~~~~~~ :124:12: warning: unused variable 'y' [-Wunused-variable] 124 | int x=0, y=0; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/connect.cc:131:7: warning: unused variable 'localRanks' [-Wunused-variable] 131 | int localRanks = comm->topo->nodes[GPU].count; | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/connect.cc:124:12: warning: unused variable 'y' [-Wunused-variable] 124 | int x=0, y=0; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/connect.cc:131:7: warning: unused variable 'localRanks' [-Wunused-variable] 131 | int localRanks = comm->topo->nodes[GPU].count; | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/connect.cc:124:12: warning: unused variable 'y' [-Wunused-variable] 124 | int x=0, y=0; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/connect.cc:131:7: warning: unused variable 'localRanks' [-Wunused-variable] 131 | int localRanks = comm->topo->nodes[GPU].count; | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/connect.cc:124:12: warning: unused variable 'y' [-Wunused-variable] 124 | int x=0, y=0; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/connect.cc:131:7: warning: unused variable 'localRanks' [-Wunused-variable] 131 | int localRanks = comm->topo->nodes[GPU].count; | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/connect.cc:124:12: warning: unused variable 'y' [-Wunused-variable] 124 | int x=0, y=0; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/connect.cc:131:7: warning: unused variable 'localRanks' [-Wunused-variable] 131 | int localRanks = comm->topo->nodes[GPU].count; | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/connect.cc:124:12: warning: unused variable 'y' [-Wunused-variable] 124 | int x=0, y=0; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/connect.cc:131:7: warning: unused variable 'localRanks' [-Wunused-variable] 131 | int localRanks = comm->topo->nodes[GPU].count; | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/connect.cc:124:12: warning: unused variable 'y' [-Wunused-variable] 124 | int x=0, y=0; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/connect.cc:131:7: warning: unused variable 'localRanks' [-Wunused-variable] 131 | int localRanks = comm->topo->nodes[GPU].count; | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/connect.cc:15: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/connect.cc:265:21: warning: unused function 'getIndexes' [-Wunused-function] 265 | static ncclResult_t getIndexes(int* ranks, int* indexes, int nNodes) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/connect.cc:439:21: warning: unused function 'connectNvls' [-Wunused-function] 439 | static ncclResult_t connectNvls(struct ncclComm* comm, int* nvlsHeads, int nHeads) { | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/connect.cc:15: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/connect.cc:265:21: warning: unused function 'getIndexes' [-Wunused-function] 265 | static ncclResult_t getIndexes(int* ranks, int* indexes, int nNodes) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/connect.cc:439:21: warning: unused function 'connectNvls' [-Wunused-function] 439 | static ncclResult_t connectNvls(struct ncclComm* comm, int* nvlsHeads, int nHeads) { | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/connect.cc:15: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/connect.cc:15: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/connect.cc:265:21: warning: unused function 'getIndexes' [-Wunused-function] 265 | static ncclResult_t getIndexes(int* ranks, int* indexes, int nNodes) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/connect.cc:439:21: warning: unused function 'connectNvls' [-Wunused-function] 439 | static ncclResult_t connectNvls(struct ncclComm* comm, int* nvlsHeads, int nHeads) { | ^~~~~~~~~~~ | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/connect.cc:265:21: warning: unused function 'getIndexes' [-Wunused-function] 265 | static ncclResult_t getIndexes(int* ranks, int* indexes, int nNodes) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/connect.cc:439:21: warning: unused function 'connectNvls' [-Wunused-function] 439 | static ncclResult_t connectNvls(struct ncclComm* comm, int* nvlsHeads, int nHeads) { | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/connect.cc:15: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/connect.cc:265:21: warning: unused function 'getIndexes' [-Wunused-function] 265 | static ncclResult_t getIndexes(int* ranks, int* indexes, int nNodes) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/connect.cc:439:21: warning: unused function 'connectNvls' [-Wunused-function] 439 | static ncclResult_t connectNvls(struct ncclComm* comm, int* nvlsHeads, int nHeads) { | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/connect.cc:15: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/connect.cc:265:21: warning: unused function 'getIndexes' [-Wunused-function] 265 | static ncclResult_t getIndexes(int* ranks, int* indexes, int nNodes) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/connect.cc:439:21: warning: unused function 'connectNvls' [-Wunused-function] 439 | static ncclResult_t connectNvls(struct ncclComm* comm, int* nvlsHeads, int nHeads) { | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/connect.cc:15: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/connect.cc:265:21: warning: unused function 'getIndexes' [-Wunused-function] 265 | static ncclResult_t getIndexes(int* ranks, int* indexes, int nNodes) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/connect.cc:439:21: warning: unused function 'connectNvls' [-Wunused-function] 439 | static ncclResult_t connectNvls(struct ncclComm* comm, int* nvlsHeads, int nHeads) { | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/connect.cc:15: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/connect.cc:265:21: warning: unused function 'getIndexes' [-Wunused-function] 265 | static ncclResult_t getIndexes(int* ranks, int* indexes, int nNodes) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/connect.cc:439:21: warning: unused function 'connectNvls' [-Wunused-function] 439 | static ncclResult_t connectNvls(struct ncclComm* comm, int* nvlsHeads, int nHeads) { | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/connect.cc:15: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/connect.cc:265:21: warning: unused function 'getIndexes' [-Wunused-function] 265 | static ncclResult_t getIndexes(int* ranks, int* indexes, int nNodes) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/connect.cc:439:21: warning: unused function 'connectNvls' [-Wunused-function] 439 | static ncclResult_t connectNvls(struct ncclComm* comm, int* nvlsHeads, int nHeads) { | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/connect.cc:15: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/connect.cc:265:21: warning: unused function 'getIndexes' [-Wunused-function] 265 | static ncclResult_t getIndexes(int* ranks, int* indexes, int nNodes) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/connect.cc:439:21: warning: unused function 'connectNvls' [-Wunused-function] 439 | static ncclResult_t connectNvls(struct ncclComm* comm, int* nvlsHeads, int nHeads) { | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/connect.cc:15: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/connect.cc:265:21: warning: unused function 'getIndexes' [-Wunused-function] 265 | static ncclResult_t getIndexes(int* ranks, int* indexes, int nNodes) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/connect.cc:439:21: warning: unused function 'connectNvls' [-Wunused-function] 439 | static ncclResult_t connectNvls(struct ncclComm* comm, int* nvlsHeads, int nHeads) { | ^~~~~~~~~~~ 13 warnings generated when compiling for gfx1201. 13 warnings generated when compiling for gfx1101. 13 warnings generated when compiling for gfx1200. 13 warnings generated when compiling for gfx1100. 13 warnings generated when compiling for gfx906. 13 warnings generated when compiling for gfx1030. 13 warnings generated when compiling for gfx942. 13 warnings generated when compiling for gfx908. 13 warnings generated when compiling for gfx90a. 13 warnings generated when compiling for gfx1102. 2 warnings generated when compiling for gfx90a. 2 warnings generated when compiling for gfx1100. 2 warnings generated when compiling for gfx1201. 2 warnings generated when compiling for gfx906. 2 warnings generated when compiling for gfx1200. 2 warnings generated when compiling for gfx1030. 2 warnings generated when compiling for gfx1102. 2 warnings generated when compiling for gfx1101. 13 warnings generated when compiling for host. [ 48%] Building CXX object CMakeFiles/rccl.dir/hipify/src/graph/paths.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/graph/paths.cc.o -MF CMakeFiles/rccl.dir/hipify/src/graph/paths.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/graph/paths.cc.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/paths.cc [ 48%] Building CXX object CMakeFiles/rccl.dir/hipify/src/graph/rings.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/graph/rings.cc.o -MF CMakeFiles/rccl.dir/hipify/src/graph/rings.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/graph/rings.cc.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rings.cc In file included from In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/paths.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/paths.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/paths.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/paths.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rings.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rings.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/paths.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/paths.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/paths.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/paths.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/paths.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/paths.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/paths.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rings.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rings.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rings.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rings.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rings.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rings.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rings.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/paths.cc:275:7: warning: variable 'intermediateIndex' set but not used [-Wunused-but-set-variable] 275 | int intermediateIndex = -1; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/paths.cc:275:7: warning: variable 'intermediateIndex' set but not used [-Wunused-but-set-variable] 275 | int intermediateIndex = -1; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rings.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/paths.cc:275:7: warning: variable 'intermediateIndex' set but not used [-Wunused-but-set-variable] 275 | int intermediateIndex = -1; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/paths.cc/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/paths.cc:462:24: warning: unused variable 'gpu' [-Wunused-variable] 462 | struct ncclTopoNode* gpu = system->nodes[GPU].nodes+g; | ^~~ :462:24: warning: unused variable 'gpu' [-Wunused-variable] 462 | struct ncclTopoNode* gpu = system->nodes[GPU].nodes+g; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/paths.cc:462:24: warning: unused variable 'gpu' [-Wunused-variable] 462 | struct ncclTopoNode* gpu = system->nodes[GPU].nodes+g; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rings.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for host. /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/paths.cc:275:7: warning: variable 'intermediateIndex' set but not used [-Wunused-but-set-variable] 275 | int intermediateIndex = -1; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/paths.cc:462:24: warning: unused variable 'gpu' [-Wunused-variable] 462 | struct ncclTopoNode* gpu = system->nodes[GPU].nodes+g; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/paths.cc:275:7: warning: variable 'intermediateIndex' set but not used [-Wunused-but-set-variable] 275 | int intermediateIndex = -1; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/paths.cc:275:7: warning: variable 'intermediateIndex' set but not used [-Wunused-but-set-variable] 275 | int intermediateIndex = -1; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/paths.cc/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/paths.cc:275:7: warning: variable 'intermediateIndex' set but not used [-Wunused-but-set-variable] 275 | int intermediateIndex = -1; | ^ :275:7: warning: variable 'intermediateIndex' set but not used [-Wunused-but-set-variable] 275 | int intermediateIndex = -1; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/paths.cc:462:24: warning: unused variable 'gpu' [-Wunused-variable] 462 | struct ncclTopoNode* gpu = system->nodes[GPU].nodes+g; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/paths.cc:275:7: warning: variable 'intermediateIndex' set but not used [-Wunused-but-set-variable] 275 | int intermediateIndex = -1; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/paths.cc/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/paths.cc:462:24: warning: unused variable 'gpu' [-Wunused-variable] 462 | struct ncclTopoNode* gpu = system->nodes[GPU].nodes+g; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/paths.cc:462:24: warning: unused variable 'gpu' [-Wunused-variable] 462 | struct ncclTopoNode* gpu = system->nodes[GPU].nodes+g; | ^~~ :462:24: warning: unused variable 'gpu' [-Wunused-variable] 462 | struct ncclTopoNode* gpu = system->nodes[GPU].nodes+g; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/paths.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/paths.cc:11: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/paths.cc:17: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValu/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/paths.ccIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/paths.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/paths.cc:11: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/paths.cc:17: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { :462:24: warning: unused variable 'gpu' [-Wunused-variable] e) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:125:21:462 warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | s | tatic ncclR esult_t xml GetAttrLonsg(structt ncclXmlNodre* node, cuonst char*c attrName,t int64_/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/paths.cc ncct* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/paths.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/paths.cc:11: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int clTopoNode* gpu = system->nodes[GPU].nodes+g; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/paths.cc:275:7: warning: variable 'intermediateIndex' set but not used [-Wunused-but-set-variable] 275 | int intermediateIndex = -1; | ^ | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ udaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/paths.cc:17: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncc:275:7: warning: variable 'intermediateIndex' set but not used [-Wunused-but-set-variable] 275 | int intermediateIndex = -1; | ^ mlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ lXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/paths.cc:462/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/paths.cc:462:24: warning: unused variable 'gpu' [-Wunused-variable] 462 | struct ncclTopoNode* gpu = system->nodes[GPU].nodes+g; | ^~~ :24: warning: unused variable 'gpu' [-Wunused-variable] 462 | struct ncclTopoNode* gpu = system->nodes[GPU].nodes+g; | ^~~ 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx1101. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/paths.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/paths.cc:11: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/paths.cc:17: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/paths.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/paths.cc:11: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/paths.cc:17: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/paths.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/paths.cc:11: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/paths.cc:17: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/paths.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/paths.cc:11: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/paths.cc:17: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | stati 140 | static ncclRcxmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, consesult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* nod ncclReesult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | , const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:static ncclResult_t xmlSettA char* atttrName, tconst char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct rIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* vancclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/paths.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/paths.cc:11: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, 377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struicntt *k vnDeitcDte*v )d i{c t )| ^~~~~~~~~~~~~~~~~~{ | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | st/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.ha:t390i:c21 :f lwarning: ounused function 'kvConvertToStr' [-Wunused-function]a t ncclTop o390N | VsLtiantkiBcw (nicnctl RceusdualCto_tm pkCvaCpo)n v{e r t| T ^~~~~~~~~~~~~~~~o Str/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h(:i282n:t13 :v awarning: lunused function 'isPow2' [-Wunused-function]u e, c o282n | sstt acthiacr *b*o oslt ri, struct kvDictsP*o wd2ic(ti)n t{ v a| l ^~~~~~~~~~~~~~) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/paths.cc:17: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/paths.cc:{ | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* att9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/paths.cc:11: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/paths.cc:17: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(strulue, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ rName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struc1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx908. t ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ 1 warning generated when compiling for gfx1100. ct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/paths.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/paths.cc:11: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/paths.cc:17: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat1 warning generated when compiling for gfx1200. (struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/paths.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/paths.cc:11: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(st 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ ruct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/paths.cc:17: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static 1 warning generated when compiling for gfx1201. ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx942. [ 48%] Building CXX object CMakeFiles/rccl.dir/hipify/src/graph/rome_models.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/graph/rome_models.cc.o -MF CMakeFiles/rccl.dir/hipify/src/graph/rome_models.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/graph/rome_models.cc.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc 31 warnings generated when compiling for gfx1102. 31 warnings generated when compiling for gfx1201. 31 warnings generated when compiling for gfx90a. 31 warnings generated when compiling for gfx1030. 31 warnings generated when compiling for gfx906. 31 warnings generated when compiling for gfx942. 31 warnings generated when compiling for gfx1100. 31 warnings generated when compiling for gfx1101. 31 warnings generated when compiling for gfx908. 31 warnings generated when compiling for gfx1200. 31 warnings generated when compiling for host. [ 49%] Building CXX object CMakeFiles/rccl.dir/hipify/src/graph/search.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/graph/search.cc.o -MF CMakeFiles/rccl.dir/hipify/src/graph/search.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/graph/search.cc.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/search.cc In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:24: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:24: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:24: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:24: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:24: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:24: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:24: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:24: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:24: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:24: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:24: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:1341:7: warning: unused variable 'nChannels' [-Wunused-variable] 1341 | int nChannels = 0; | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:1351:12: warning: unused variable 'y' [-Wunused-variable] 1351 | int x=0, y=0; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:1858:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 1858 | int gcnt = 0; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:1930:9: warning: unused variable 't' [-Wunused-variable] 1930 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:2036:7: warning: unused variable 'ncpus' [-Wunused-variable] 2036 | int ncpus = system->nodes[CPU].count; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:2130:9: warning: unused variable 't' [-Wunused-variable] 2130 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:1341:7: warning: unused variable 'nChannels' [-Wunused-variable] 1341 | int nChannels = 0; | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:1351:12: warning: unused variable 'y' [-Wunused-variable] 1351 | int x=0, y=0; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:1341:7: warning: unused variable 'nChannels' [-Wunused-variable] 1341 | int nChannels = 0; | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:1351:12: warning: unused variable 'y' [-Wunused-variable] 1351 | int x=0, y=0; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:2240:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 2240 | int gcnt = 0; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:2316:9: warning: unused variable 't' [-Wunused-variable] 2316 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:1341:7: warning: unused variable 'nChannels' [-Wunused-variable] 1341 | int nChannels = 0; | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:1351:12: warning: unused variable 'y' [-Wunused-variable] 1351 | int x=0, y=0; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:1341:7: warning: unused variable 'nChannels' [-Wunused-variable] 1341 | int nChannels = 0; | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:1351:12: warning: unused variable 'y' [-Wunused-variable] 1351 | int x=0, y=0; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:1341:7: warning: unused variable 'nChannels' [-Wunused-variable] 1341 | int nChannels = 0; | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:1351:12: warning: unused variable 'y' [-Wunused-variable] 1351 | int x=0, y=0; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:1341:7: warning: unused variable 'nChannels' [-Wunused-variable] 1341/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:1341:7: warning: unused variable 'nChannels' [-Wunused-variable] 1341 | int nChannels = 0; | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:1351:12: warning: unused variable 'y' [-Wunused-variable] | int nChannels = 0/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:1341:7: warning: unused variable 'nChannels' [-Wunused-variable] 1341 | int nChannels = 0; | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:1351:12: warning: unused variable 'y' [-Wunused-variable] 1351 | int x=0, y=0; | ^ ; | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:1351:12: warning: unused variable 'y' [-Wunused-variable] 1351 | int x=0, y=0; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc 1351 | int x=0, y=0; | ^ :1341:7: warning: unused variable 'nChannels' [-Wunused-variable] 1341 | int nChannels = 0; | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:1351:12: warning: unused variable 'y' [-Wunused-variable] 1351 | int x=0, y=0; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:1858:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 1858 | int gcnt = 0; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:1930:9: warning: unused variable 't' [-Wunused-variable] 1930 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:1858:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 1858 | int gcnt = 0; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:1930:9: warning: unused variable 't' [-Wunused-variable] 1930 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:1341:7: warning: unused variable 'nChannels' [-Wunused-variable] 1341 | int nChannels = 0; | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:1351:12: warning: unused variable 'y' [-Wunused-variable] 1351 | int x=0, y=0; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:2036:7: warning: unused variable 'ncpus' [-Wunused-variable] 2036 | int ncpus = system->nodes[CPU].count; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:2130:9: warning: unused variable 't' [-Wunused-variable] 2130 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:1858:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 1858 | int gcnt = 0; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:1930:9: warning: unused variable 't' [-Wunused-variable] 1930 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ :1858:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 1858 | int gcnt = 0; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:1930:9: warning: unused variable 't' [-Wunused-variable] 1930 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:2240:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 2240 | int gcnt = 0; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:2316:9: warning: unused variable 't' [-Wunused-variable] 2316 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:2036:7: warning: unused variable 'ncpus' [-Wunused-variable] 2036 | int ncpus = system->nodes[CPU].count; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:2130:9: warning: unused variable 't' [-Wunused-variable] 2130 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:1858:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 1858 | int gcnt = 0; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:1930:9: warning: unused variable 't' [-Wunused-variable] 1930 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ :1858:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 1858 | int gcnt = 0; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:1930:9: warning: unused variable 't' [-Wunused-variable] 1930 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:2036:7: warning: unused variable 'ncpus' [-Wunused-variable] 2036 | int ncpus = system->nodes[CPU].count; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:2130:9: warning: unused variable 't' [-Wunused-variable] 2130 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ :1858:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 1858 | int gcnt = 0; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:1930:9: warning: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:2036:7: warning: unused variable 'ncpus' [-Wunused-variable] 2036 | int ncpus = system->nodes[CPU].count; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:2130:9: warning: unused variable 't' [-Wunused-variable] 2130 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ unused variable 't' [-Wunused-variable] 1930 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:1858:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 1858 | int gcnt = 0; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:1930:9: warning: unused variable 't' [-Wunused-variable] 1930 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:1858:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 1858 | int gcnt = 0; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:1930:9: warning: unused variable 't' [-Wunused-variable] 1930 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:2240:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 2240 | int gcnt = 0; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:2316:9: warning: unused variable 't' [-Wunused-variable] 2316 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:2240:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 2240 | int gcnt = 0; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:2316:9: warning: unused variable 't' [-Wunused-variable] 2316 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ :2036:7: warning: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:2036:7: warning: unused variable 'ncpus' [-Wunused-variable] 2036 | int ncpus = system->nodes[CPU].count; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:2130:9: warning: unused variable 't' [-Wunused-variable] 2130 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ unused variable 'ncpus' [-Wunused-variable] 2036 | int ncpus = system-/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc>:2036:7: warning: unused variable 'ncpus' [-Wunused-variable] 2036 | int ncpus = system->nodes[CPU].count; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:2130:9: warning: unused variable 't' [-Wunused-variable] 2130 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ nodes[CPU].count; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:2130:9: warning: unused variable 't' [-Wunused-variable] 2130 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:2036:7: warning: unused variable 'ncpus' [-Wunused-variable] 2036 | int ncpus = system->nodes[CPU].count; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:2130:9: warning: unused variable 't' [-Wunused-variable] 2130 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:1858:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 1858 | int gcnt = 0; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:1930:9: warning: unused variable 't' [-Wunused-variable] 1930 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:2036:7: warning: unused variable 'ncpus' [-Wunused-variable] 2036 | int ncpus = system->nodes[CPU].count; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:2130:9: warning: unused variable 't' [-Wunused-variable] 2130 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:2240:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 2240 | int gcnt = 0; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:2316:9: warning: unused variable 't' [-Wunused-variable] 2316 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:2240:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 2240 | int gcnt = 0; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:2316:9: warning: unused variable 't' [-Wunused-variable] 2316 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:2240:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 2240 | int gcnt = 0; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:2316:9: warning: unused variable 't' [-Wunused-variable] 2316 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:2240:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 2240 | int gcnt = 0; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:2316:9: warning: unused variable 't' [-Wunused-variable] 2316 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:2240:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 2240 | int gcnt = 0; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:2316:9: warning: unused variable 't' [-Wunused-variable] 2316 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:2240:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 2240 | int gcnt = 0; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:2316:9: warning: unused variable 't' [-Wunused-variable] 2316 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:2036:7: warning: unused variable 'ncpus' [-Wunused-variable] 2036 | int ncpus = system->nodes[CPU].count; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:2130:9: warning: unused variable 't' [-Wunused-variable] 2130 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:23: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:26: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:27: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:2240:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 2240 | int gcnt = 0; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:2316:9: warning: unused variable 't' [-Wunused-variable] 2316 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:23: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:26: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:27: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:23: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:26: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:27: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const clFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] har* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:23: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | staticIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:23: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:26: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:27: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:26: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:27: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t vaIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:23: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:26: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:27: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const chIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:23: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | sstatict ncclRaesult_tt xmlFiindNextTcag(str uct nlcclXmlo* xml,n constg log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:26: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:27: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, lcounes)t {c h a| r ^~~~~~~~~~~~~~* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:118/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h::21:267 :warning: 21unused function 'xmlGetAttrIntDefault' [-Wunused-function]: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 118 | static n267c | csltRaetsiucl tn_ctc lxRmelsGuelttA_ttt rxImnltUDnesfeatuAlttt(rs(tsrturcutc tn cncclcXlmXlmNloNdoed*e *n ondoede, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ ar* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct n, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* valucclXmlNe) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.hode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ :377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:23: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:23: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:26: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:27: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* at44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:26: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:27: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTatrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ g(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:23: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:26: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:27: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:267:In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:23: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:26: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/rome_models.cc:27: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xml21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ FindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ 38 warnings generated when compiling for gfx1201. 38 warnings generated when compiling for gfx1101. 38 warnings generated when compiling for gfx90a. 38 warnings generated when compiling for gfx1100. 38 warnings generated when compiling for gfx906. 38 warnings generated when compiling for gfx1102. 38 warnings generated when compiling for gfx1030. 38 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/search.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 38 warnings generated when compiling for gfx1200. 38 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/search.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/search.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/search.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/search.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/search.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/search.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/search.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/search.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/search.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/search.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/search.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/search.cc:13: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/search.cc:15: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/search.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/search.cc:13: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/search.cc:15: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const chaIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/search.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/search.cc:13: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/search.cc:15: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* r* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRxml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ emoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/search.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/search.cc:13: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/search.cc:15: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/search.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/search.cc:13: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/search.cc:15: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/search.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/search.cc:13: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/search.cc:15: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/search.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/search.cc:13: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/search.cc:15: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | stat125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ ic ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/search.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/search.cc:13: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/search.cc:15: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/search.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/search.cc:13: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/search.cc:15: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/search.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/search.cc:13: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/search.cc:15: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/search.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/search.cc:13: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/search.cc:15: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ 18 warnings generated when compiling for gfx906. 18 warnings generated when compiling for gfx1100. 18 warnings generated when compiling for gfx1201. 18 warnings generated when compiling for gfx1101. 18 warnings generated when compiling for gfx90a. 18 warnings generated when compiling for gfx1200. 18 warnings generated when compiling for gfx1102. 18 warnings generated when compiling for gfx1030. 18 warnings generated when compiling for gfx908. 18 warnings generated when compiling for gfx942. 38 warnings generated when compiling for host. [ 49%] Building CXX object CMakeFiles/rccl.dir/hipify/src/graph/topo.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/graph/topo.cc.o -MF CMakeFiles/rccl.dir/hipify/src/graph/topo.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/graph/topo.cc.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.cc 18 warnings generated when compiling for host. [ 49%] Building CXX object CMakeFiles/rccl.dir/hipify/src/graph/trees.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/graph/trees.cc.o -MF CMakeFiles/rccl.dir/hipify/src/graph/trees.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/graph/trees.cc.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/trees.cc In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.cc:11: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.cc:15: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.cc:19: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.cc:11: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.cc:15: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.cc:19: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.cc:11: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.cc:15: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, co 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | snst int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ tatic ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.cc:19: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.cc:11: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.cc:15: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.cc:19: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.cc:11: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.cc:15: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.cc:19: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.cc:11: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) {struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.cc:11: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.cc:15: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] clTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.cc:15: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, in_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ t type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.cc:19: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.cc:19: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, costatic ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const flnst char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:oat value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | sta241:tic ncclResult_t xmlS21etAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] : warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | s 267 | static ncclResult_t xmlUnsetAttr(struct nccltatiXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] c ncc 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function]l Result _334t | sxtmaltSeitcA ntctcrlFlRoeasutl(ts_ttru cxtm lnRcecmloXvmelNNooddee(*s tnruct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ ode, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.cc:9: In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.cc:11: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.cc:11: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.cc:15: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev,:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static nc void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm))c; return nccllResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.cc:15: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connecSuccet(handsls; }es | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h, nra:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | stnks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:atic ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(c21:21:o warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | stmatic mncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_-t >redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int counncclCollNet->regMr(collComm, t, ncclDataType_t datadata, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:28Type, ncclRedOp_t redOp, void* sendMhandle, :21:vo id* recvMhandwarning: le, unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush( void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.cc:19: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static nstruct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.cc:19: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ cclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.cc:11: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.cc:15: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.cc:19: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.cc:11: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.cc:15: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.cc:19: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ 30 warnings generated when compiling for gfx1101. 30 warnings generated when compiling for gfx1201. 30 warnings generated when compiling for gfx1030. 30 warnings generated when compiling for gfx90a. 30 warnings generated when compiling for gfx942. 30 warnings generated when compiling for gfx1102. 30 warnings generated when compiling for gfx908. 3030 warnings generated when compiling for gfx1100. warnings generated when compiling for gfx906. 30 warnings generated when compiling for gfx1200. [ 49%] Building CXX object CMakeFiles/rccl.dir/hipify/src/graph/tuning.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/graph/tuning.cc.o -MF CMakeFiles/rccl.dir/hipify/src/graph/tuning.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/graph/tuning.cc.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/tuning.cc 30 warnings generated when compiling for host. [ 50%] Building CXX object CMakeFiles/rccl.dir/hipify/src/graph/xml.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/graph/xml.cc.o -MF CMakeFiles/rccl.dir/hipify/src/graph/xml.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/graph/xml.cc.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.cc In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/tuning.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/tuning.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/tuning.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/tuning.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/tuning.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/tuning.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/tuning.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/tuning.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/tuning.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/tuning.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/tuning.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/tuning.cc:337:10: warning: unused variable 'llMaxBw' [-Wunused-variable] 337 | double llMaxBw = llMaxBws[index1][index2]; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/tuning.cc:338:10: warning: unused variable 'perChMaxTreeBw' [-Wunused-variable] 338 | double perChMaxTreeBw = perChMaxTreeBws[compCapIndex][index2]; | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/tuning.cc:339:10: warning: unused variable 'perChMaxRingLL128Bw' [-Wunused-variable] 339 | double perChMaxRingLL128Bw = perChMaxRingLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/tuning.cc:340:10: warning: unused variable 'perChMaxTreeLL128Bw' [-Wunused-variable] 340 | double perChMaxTreeLL128Bw = perChMaxTreeLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/tuning.cc:343:9: warning: unused variable 'ppn' [-Wunused-variable] 343 | float ppn = (float)nRanks / nNodes; // if ppn < 2, then we are sending/receiving at the same GPU through the NIC, apply some bw discount | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/tuning.cc:337:10: warning: unused variable 'llMaxBw' [-Wunused-variable] 337 | double llMaxBw = llMaxBws[index1][index2]; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/tuning.cc:338:10: warning: unused variable 'perChMaxTreeBw' [-Wunused-variable] 338 | double perChMaxTreeBw = perChMaxTreeBws[compCapIndex][index2]; | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/tuning.cc:339:10: warning: unused variable 'perChMaxRingLL128Bw' [-Wunused-variable] 339 | double perChMaxRingLL128Bw = perChMaxRingLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/tuning.cc:340:10: warning: unused variable 'perChMaxTreeLL128Bw' [-Wunused-variable] 340 | double perChMaxTreeLL128Bw = perChMaxTreeLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/tuning.cc:343:9: warning: unused variable 'ppn' [-Wunused-variable] 343 | float ppn = (float)nRanks / nNodes; // if ppn < 2, then we are sending/receiving at the same GPU through the NIC, apply some bw discount | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/tuning.cc:337:10: warning: unused variable 'llMaxBw' [-Wunused-variable] 337 | double llMaxBw = llMaxBws[index1][index2]; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/tuning.cc:338:10: warning: unused variable 'perChMaxTreeBw' [-Wunused-variable] 338 | double perChMaxTreeBw = perChMaxTreeBws[compCapIndex][index2]; | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/tuning.cc:339:10: warning: unused variable 'perChMaxRingLL128Bw' [-Wunused-variable] 339 | double perChMaxRingLL128Bw = perChMaxRingLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/tuning.cc:340:10: warning: unused variable 'perChMaxTreeLL128Bw' [-Wunused-variable] 340 | double perChMaxTreeLL128Bw = perChMaxTreeLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/tuning.cc:343:9: warning: unused variable 'ppn' [-Wunused-variable] 343 | float ppn = (float)nRanks / nNodes; // if ppn < 2, then we are sending/receiving at the same GPU through the NIC, apply some bw discount | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/tuning.cc:337:10: warning: unused variable 'llMaxBw' [-Wunused-variable] 337 | double llMaxBw = llMaxBws[index1][index2]; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/tuning.cc:338:10: warning: unused variable 'perChMaxTreeBw' [-Wunused-variable] 338 | double perChMaxTreeBw = perChMaxTreeBws[compCapIndex][index2]; | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/tuning.cc:339:10: warning: unused variable 'perChMaxRingLL128Bw' [-Wunused-variable] 339 | double perChMaxRingLL128Bw = perChMaxRingLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/tuning.cc:340:10: warning: unused variable 'perChMaxTreeLL128Bw' [-Wunused-variable] 340 | double perChMaxTreeLL128Bw = perChMaxTreeLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/tuning.cc:343:9: warning: unused variable 'ppn' [-Wunused-variable] 343 | float ppn = (float)nRanks / nNodes; // if ppn < 2, then we are sending/receiving at the same GPU through the NIC, apply some bw discount | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/tuning.cc:12: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/tuning.cc:626:14: warning: unused variable 'treeCorrectionFactor' [-Wunused-variable] 626 | static float treeCorrectionFactor[NCCL_NUM_PROTOCOLS][23] = { | ^~~~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/tuning.cc:12: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/tuning.ccn:dex(str12uct ncclTop: oSystem*/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h system, int type:, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/tuning.cc:626:14: warning: unused variable 'treeCorrectionFactor' [-Wunused-variable] 626 | static float treeCorrectionFactor[NCCL_NUM_PROTOCOLS][23] = { | ^~~~~~~~~~~~~~~~~~~~ 214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/tuning.cc:626:14: warning: unused variable 'treeCorrectionFactor' [-Wunused-variable] 626 | static float treeCorrectionFactor[NCCL_NUM_PROTOCOLS][23] = { | ^~~~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/tuning.cc:12: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/tuning.cc:626:14: warning: unused variable 'treeCorrectionFactor' [-Wunused-variable] 626 | static float treeCorrectionFactor[NCCL_NUM_PROTOCOLS][23] = { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/tuning.cc:337:10: warning: unused variable 'llMaxBw' [-Wunused-variable] 337 | double llMaxBw = llMaxBws[index1][index2]; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/tuning.cc:338:10: warning: unused variable 'perChMaxTreeBw' [-Wunused-variable] 338 | double perChMaxTreeBw = perChMaxTreeBws[compCapIndex][index2]; | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/tuning.cc:339:10: warning: unused variable 'perChMaxRingLL128Bw' [-Wunused-variable] 339 | double perChMaxRingLL128Bw = perChMaxRingLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/tuning.cc:340:10: warning: unused variable 'perChMaxTreeLL128Bw' [-Wunused-variable] 340 | double perChMaxTreeLL128Bw = perChMaxTreeLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/tuning.cc:343:9: warning: unused variable 'ppn' [-Wunused-variable] 343 | float ppn = (float)nRanks / nNodes; // if ppn < 2, then we are sending/receiving at the same GPU through the NIC, apply some bw discount | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/tuning.cc:337:10: warning: unused variable 'llMaxBw' [-Wunused-variable] 337 | double llMaxBw = llMaxBws[index1][index2]; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/tuning.cc:338:10: warning: unused variable 'perChMaxTreeBw' [-Wunused-variable] 338 | double perChMaxTreeBw = perChMaxTreeBws[compCapIndex][index2]; | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/tuning.cc:339:10: warning: unused variable 'perChMaxRingLL128Bw' [-Wunused-variable] 339 | double perChMaxRingLL128Bw = perChMaxRingLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/tuning.cc:340:10: warning: unused variable 'perChMaxTreeLL128Bw' [-Wunused-variable] 340 | double perChMaxTreeLL128Bw = perChMaxTreeLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/tuning.cc:343:9: warning: unused variable 'ppn' [-Wunused-variable] 343 | float ppn = (float)nRanks / nNodes; // if ppn < 2, then we are sending/receiving at the same GPU through the NIC, apply some bw discount | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/tuning.cc:337:10: warning: unused variable 'llMaxBw' [-Wunused-variable] 337 | double llMaxBw = llMaxBws[index1][index2]; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/tuning.cc:338:10: warning: unused variable 'perChMaxTreeBw' [-Wunused-variable] 338 | double perChMaxTreeBw = perChMaxTreeBws[compCapIndex][index2]; | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/tuning.cc:339:10: warning: unused variable 'perChMaxRingLL128Bw' [-Wunused-variable] 339 | double perChMaxRingLL128Bw = perChMaxRingLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/tuning.cc:340:10: warning: unused variable 'perChMaxTreeLL128Bw' [-Wunused-variable] 340 | double perChMaxTreeLL128Bw = perChMaxTreeLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/tuning.cc:343:9: warning: unused variable 'ppn' [-Wunused-variable] 343 | float ppn = (float)nRanks / nNodes; // if ppn < 2, then we are sending/receiving at the same GPU through the NIC, apply some bw discount | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/tuning.cc:337:10: warning: unused variable 'llMaxBw' [-Wunused-variable] 337 | double llMaxBw = llMaxBws[index1][index2]; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/tuning.cc:338:10: warning: unused variable 'perChMaxTreeBw' [-Wunused-variable] 338 | double perChMaxTreeBw = perChMaxTreeBws[compCapIndex][index2]; | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/tuning.cc:339:10: warning: unused variable 'perChMaxRingLL128Bw' [-Wunused-variable] 339 | double perChMaxRingLL128Bw = perChMaxRingLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/tuning.cc:340:10: warning: unused variable 'perChMaxTreeLL128Bw' [-Wunused-variable] 340 | double perChMaxTreeLL128Bw = perChMaxTreeLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/tuning.cc:343:9: warning: unused variable 'ppn' [-Wunused-variable] 343 | float ppn = (float)nRanks / nNodes; // if ppn < 2, then we are sending/receiving at the same GPU through the NIC, apply some bw discount | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/tuning.cc:337:10: warning: unused variable 'llMaxBw' [-Wunused-variable] 337 | double llMaxBw = llMaxBws[index1][index2]; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/tuning.cc:338:10: warning: unused variable 'perChMaxTreeBw' [-Wunused-variable] 338 | double perChMaxTreeBw = perChMaxTreeBws[compCapIndex][index2]; | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/tuning.cc:339:10: warning: unused variable 'perChMaxRingLL128Bw' [-Wunused-variable] 339 | double perChMaxRingLL128Bw = perChMaxRingLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/tuning.cc:340:10: warning: unused variable 'perChMaxTreeLL128Bw' [-Wunused-variable] 340 | double perChMaxTreeLL128Bw = perChMaxTreeLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/tuning.cc:343:9: warning: unused variable 'ppn' [-Wunused-variable] 343 | float ppn = (float)nRanks / nNodes; // if ppn < 2, then we are sending/receiving at the same GPU through the NIC, apply some bw discount | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/tuning.cc:337:10: warning: unused variable 'llMaxBw' [-Wunused-variable] 337 | double llMaxBw = llMaxBws[index1][index2]; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/tuning.cc:338:10: warning: unused variable 'perChMaxTreeBw' [-Wunused-variable] 338 | double perChMaxTreeBw = perChMaxTreeBws[compCapIndex][index2]; | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/tuning.cc:339:10: warning: unused variable 'perChMaxRingLL128Bw' [-Wunused-variable] 339 | double perChMaxRingLL128Bw = perChMaxRingLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/tuning.cc:340:10: warning: unused variable 'perChMaxTreeLL128Bw' [-Wunused-variable] 340 | double perChMaxTreeLL128Bw = perChMaxTreeLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/tuning.cc:343:9: warning: unused variable 'ppn' [-Wunused-variable] 343 | float ppn = (float)nRanks / nNodes; // if ppn < 2, then we are sending/receiving at the same GPU through the NIC, apply some bw discount | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/tuning.cc:337:10: warning: unused variable 'llMaxBw' [-Wunused-variable] 337 | double llMaxBw = llMaxBws[index1][index2]; | ^~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/tuning.cc:338:10: warning: unused variable 'perChMaxTreeBw' [-Wunused-variable] 338 | double perChMaxTreeBw = perChMaxTreeBws[compCapIndex][index2]; | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/tuning.cc:339:10: warning: unused variable 'perChMaxRingLL128Bw' [-Wunused-variable] 339 | double perChMaxRingLL128Bw = perChMaxRingLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/tuning.cc:340:10: warning: unused variable 'perChMaxTreeLL128Bw' [-Wunused-variable] 340 | double perChMaxTreeLL128Bw = perChMaxTreeLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/tuning.cc:343:9: warning: unused variable 'ppn' [-Wunused-variable] 343 | float ppn = (float)nRanks / nNodes; // if ppn < 2, then we are sending/receiving at the same GPU through the NIC, apply some bw discount | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/tuning.cc:12: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/tuning.cc:626:14: warning: unused variable 'treeCorrectionFactor' [-Wunused-variable] 626 | static float treeCorrectionFactor[NCCL_NUM_PROTOCOLS][23] = { | ^~~~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/tuning.cc:12: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/tuning.cc:626:14: warning: unused variable 'treeCorrectionFactor' [-Wunused-variable] 626 | static float treeCorrectionFactor[NCCL_NUM_PROTOCOLS][23] = { | ^~~~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/tuning.cc:12: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/tuning.cc:626:14: warning: unused variable 'treeCorrectionFactor' [-Wunused-variable] 626 | static float treeCorrectionFactor[NCCL_NUM_PROTOCOLS][23] = { | ^~~~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/tuning.cc:12: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/tuning.cc:626:14: warning: unused variable 'treeCorrectionFactor' [-Wunused-variable] 626 | static float treeCorrectionFactor[NCCL_NUM_PROTOCOLS][23] = { | ^~~~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/tuning.cc:12: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/tuning.cc:626:14: warning: unused variable 'treeCorrectionFactor' [-Wunused-variable] 626 | static float treeCorrectionFactor[NCCL_NUM_PROTOCOLS][23] = { | ^~~~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/tuning.cc:12: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/tuning.cc:626:14: warning: unused variable 'treeCorrectionFactor' [-Wunused-variable] 626 | static float treeCorrectionFactor[NCCL_NUM_PROTOCOLS][23] = { | ^~~~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/tuning.cc:12: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/tuning.cc:626:14: warning: unused variable 'treeCorrectionFactor' [-Wunused-variable] 626 | static float treeCorrectionFactor[NCCL_NUM_PROTOCOLS][23] = { | ^~~~~~~~~~~~~~~~~~~~ 15 warnings generated when compiling for gfx1200. 15 warnings generated when compiling for gfx942. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx1100. 1515 warnings generated when compiling for gfx1030. warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx1201. 15 warnings generated when compiling for host. [ 50%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/alt_rsmi.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/alt_rsmi.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/alt_rsmi.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/alt_rsmi.cc.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/alt_rsmi.cc In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.cc:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.cc:17: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.cc:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.cc:17: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.cc:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.cc:17: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.cc:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n)xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | st { atic ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, i| ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.cc:17: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ nt defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.cc:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.cc:17: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.cc:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.cc:17: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.cc:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.cc:17: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.cc:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.cc:17: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.cc:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.cc:17: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.cc:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.cc:17: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloatIn file included from loc(struct ncclXml** xml, i(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305nt maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ | stat/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.cc:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13:ic ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] warning: unused function 'log2i' [-Wunused-function] 44 | static 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static nccllongResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.cc:17: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ 10 warnings generated when compiling for gfx908. 10 warnings generated when compiling for gfx1100. 10 warnings generated when compiling for gfx90a. 10 warnings generated when compiling for gfx1102. 10 warnings generated when compiling for gfx1101. 10 warnings generated when compiling for gfx906. 1010 warnings generated when compiling for gfx1201. warnings generated when compiling for gfx942. 10 warnings generated when compiling for gfx1200. 10 warnings generated when compiling for gfx1030. 10 warnings generated when compiling for host. [ 50%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/archinfo.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/archinfo.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/archinfo.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/archinfo.cc.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/archinfo.cc /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:105:33: warning: bitwise negation of a boolean expression always evaluates to 'true'; did you mean logical negation? [-Wbool-operation] 105 | if (ret_gpu_id == 0 && ~(ret_unique_id != 0 || ret_loc_id != 0 || ret_unique_id != 0 || ret_vendor != 0) && | ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | ! /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:103:13: warning: unused variable 'ret_domain' [-Wunused-variable] 103 | int ret_domain = read_node_properties(node_id, "domain", &domain, properties); | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:233:22: warning: unused variable 'hops' [-Wunused-variable] 233 | uint64_t hops; | ^~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:70:14: warning: unused variable 'count' [-Wunused-variable] 70 | uint32_t count = 0; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:105:33: warning: bitwise negation of a boolean expression always evaluates to 'true'; did you mean logical negation? [-Wbool-operation] 105 | if (ret_gpu_id == 0 && ~(ret_unique_id != 0 || ret_loc_id != 0 || ret_unique_id != 0 || ret_vendor != 0) && | ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | ! /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:103:13: warning: unused variable 'ret_domain' [-Wunused-variable] 103 | int ret_domain = read_node_properties(node_id, "domain", &domain, properties); | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:105:33: warning: bitwise negation of a boolean expression always evaluates to 'true'; did you mean logical negation? [-Wbool-operation] 105 | if (ret_gpu_id == 0 && ~(ret_unique_id != 0 || ret_loc_id != 0 || ret_unique_id != 0 || ret_vendor != 0) && | ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | ! /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:103:13: warning: unused variable 'ret_domain' [-Wunused-variable] 103 | int ret_domain = read_node_properties(node_id, "domain", &domain, properties); | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:233:22: warning: unused variable 'hops' [-Wunused-variable] 233 | uint64_t hops; | ^~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:70:14: warning: unused variable 'count' [-Wunused-variable] 70 | uint32_t count = 0; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:105:33: warning: bitwise negation of a boolean expression always evaluates to 'true'; did you mean logical negation? [-Wbool-operation] 105 | if (ret_gpu_id == 0 && ~(ret_unique_id != 0 || ret_loc_id != 0 || ret_unique_id != 0 || ret_vendor != 0) && | ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | ! /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:103:13: warning: unused variable 'ret_domain' [-Wunused-variable] 103 | int ret_domain = read_node_properties(node_id, "domain", &domain, properties); | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:233:22: warning: unused variable 'hops' [-Wunused-variable] 233 | uint64_t hops; | ^~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:70:14: warning: unused variable 'count' [-Wunused-variable] 70 | uint32_t count = 0; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:105:33: warning: bitwise negation of a boolean expression always evaluates to 'true'; did you mean logical negation? [-Wbool-operation] 105 | if (ret_gpu_id == 0 && ~(ret_unique_id != 0 || ret_loc_id != 0 || ret_unique_id != 0 || ret_vendor != 0) && | ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | ! /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:103:13: warning: unused variable 'ret_domain' [-Wunused-variable] 103 | int ret_domain = read_node_properties(node_id, "domain", &domain, properties); | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:105:33: warning: bitwise negation of a boolean expression always evaluates to 'true'; did you mean logical negation? [-Wbool-operation] 105 | if (ret_gpu_id == 0 && ~(ret_unique_id != 0 || ret_loc_id != 0 || ret_unique_id != 0 || ret_vendor != 0) && | ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | ! /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:103:13: warning: unused variable 'ret_domain' [-Wunused-variable] 103 | int ret_domain = read_node_properties(node_id, "domain", &domain, properties); | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:105:33: warning: bitwise negation of a boolean expression always evaluates to 'true'; did you mean logical negation? [-Wbool-operation] 105 | if (ret_gpu_id == 0 && ~(ret_unique_id != 0 || ret_loc_id != 0 || ret_unique_id != 0 || ret_vendor != 0) && | ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | ! /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/alt_rsmi.cc/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:105:33: warning: bitwise negation of a boolean expression always evaluates to 'true'; did you mean logical negation? [-Wbool-operation] 105 | : 103:13: warning: unused variable 'ret_domain' [-Wunused-variable] i103 | f int ret_ domain = (read_noder_propertiees(node_idt, "domain_", &domain, gpropertieps); | ^~~~~~~~~~ u_id == 0 && ~(ret_unique_id != 0 || ret_loc_id != 0 || ret_unique_id != 0 || ret_vendor != 0) && | ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | ! /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:103:13: warning: unused variable 'ret_domain' [-Wunused-variable] 103 | int ret_domain = read_node_properties(node_id, "domain", &domain, properties); | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:233:22: warning: unused variable 'hops' [-Wunused-variable] 233 | uint64_t hops; | ^~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:70:14: warning: unused variable 'count' [-Wunused-variable] 70 | uint32_t count = 0; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:233:22: warning: unused variable 'hops' [-Wunused-variable] 233 | uint64_t hops; | ^~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:70:14: warning: unused variable 'count' [-Wunused-variable] 70 | uint32_t count = 0; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:105:33: warning: bitwise negation of a boolean expression always evaluates to 'true'; did you mean logical negation? [-Wbool-operation] 105 | if (ret_gpu_id == 0 && ~(ret_unique_id != 0 || ret_loc_id != 0 || ret_unique_id != 0 || ret_vendor != 0) &/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/alt_rsmi.cc& | ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ :105:33: warning: bitwise negation of a boolean expression always evaluates to 'true'; did you mean logical negation? [-Wbool-operation] 105 | if (ret_gpu_id == 0 && ~(ret_unique_id != 0 || ret_loc_id != 0 || ret_unique_id != 0 || ret_vendor != 0) && | ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | ! /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/alt_rsmi.cc | ! /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:103:13: warning: unused variable 'ret_domain' [-Wunused-variable] 103 | int ret_domain = read_node_properties(node_id, "domain", &domain, properties); | ^~~~~~~~~~ :103:13: warning: unused variable 'ret_domain' [-Wunused-variable] 103 | int ret_domain = read_node_properties(node_id, "domain", &domain, properties); | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:233:22: warning: unused variable 'hops' [-Wunused-variable] 233 | uint64_t hops; | ^~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:70:14: warning: unused variable 'count' [-Wunused-variable] 70 | uint32_t count = 0; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:105:33: warning: bitwise negation of a boolean expression always evaluates to 'true'; did you mean logical negation? [-Wbool-operation] 105 | if (ret_gpu_id == 0 && ~(ret_unique_id != 0 || ret_loc_id != 0 || ret_unique_id != 0 || ret_vendor != 0) && | ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | ! /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:103:13: warning: unused variable 'ret_domain' [-Wunused-variable] 103 | int ret_domain = read_node_properties(node_id, "domain", &domain, properties); | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:233:22: warning: unused variable 'hops' [-Wunused-variable] 233 | uint64_t hops; | ^~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:70:14: warning: unused variable 'count' [-Wunused-variable] 70 | uint32_t count = 0; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:233:22: warning: unused variable 'hops' [-Wunused-variable] 233 | uint64_t hops; | ^~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:70:14: warning: unused variable 'count' [-Wunused-variable] 70 | uint32_t count = 0; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:233:22: warning: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:233:22: warning: unused variable 'hops' [-Wunused-variable] 233 | uint64_t hops;unused variable 'hops' [-Wunused-variable] 233 | uint64_t hops; | ^~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:70:14: warning: unused variable 'count' [-Wunused-variable] 70 | uint32_t count = 0; | ^~~~~ | ^~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:70:14: warning: unused variable 'count' [-Wunused-variable] 70 | uint32_t count = 0; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:233:22: warning: unused variable 'hops' [-Wunused-variable] 233 | uint64_t hops; | ^~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:70:14: warning: unused variable 'count' [-Wunused-variable] 70 | uint32_t count = 0; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:52:20: warning: unused variable 'kPathDRMRoot' [-Wunused-variable] 52 | static const char *kPathDRMRoot = "/sys/class/drm"; | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:559:13: warning: unused function 'fileExists' [-Wunused-function] 559 | static bool fileExists(char const *filename) | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:52:20: warning: unused variable 'kPathDRMRoot' [-Wunused-variable] 52 | static const char *kPathDRMRoot = "/sys/class/drm"; | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:559:13: warning: unused function 'fileExists' [-Wunused-function] 559 | static bool fileExists(char const *filename) | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:52:20: warning: unused variable 'kPathDRMRoot' [-Wunused-variable] 52 | static const char *kPathDRMRoot = "/sys/class/drm"; | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:559:13: warning: unused function 'fileExists' [-Wunused-function] 559 | static bool fileExists(char const *filename) | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:52:20: warning: unused variable 'kPathDRMRoot' [-Wunused-variable] 52 | static const char *kPathDRMRoot = "/sys/class/drm"; | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:559:13: warning: unused function 'fileExists' [-Wunused-function] 559 | static bool fileExists(char const *filename) | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:52:20: warning: unused variable 'kPathDRMRoot' [-Wunused-variable] 52 | static const char *kPathDRMRoot = "/sys/class/drm"; | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:559:13: warning: unused function 'fileExists' [-Wunused-function] 559 | static bool fileExists(char const *filename) | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:52:20: warning: unused variable 'kPathDRMRoot' [-Wunused-variable] 52 | static const char *kPathDRMRoot = "/sys/class/drm"; | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:559:13: warning: unused function 'fileExists' [-Wunused-function] 559 | static bool fileExists(char const *filename) | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:52:20: warning: unused variable 'kPathDRMRoot' [-Wunused-variable] 52 | static const char *kPathDRMRoot = "/sys/class/drm"; | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:559:13: warning: unused function 'fileExists' [-Wunused-function] 559 | static bool fileExists(char const *filename) | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/alt_rsmi.cc/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:52:20: warning: unused variable 'kPathDRMRoot' [-Wunused-variable] 52 | static const char *kPathDRMRoot = "/sys/class/drm"; | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:559:13: warning: unused function 'fileExists' [-Wunused-function] 559 | static bool fileExists(char const *filename) | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:52:20: warning: unused variable 'kPathDRMRoot' [-Wunused-variable] 52 | static const char *kPathDRMRoot = "/sys/class/drm"; | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:559:13: warning: unused function 'fileExists' [-Wunused-function] 559 | static bool fileExists(char const *filename) | ^~~~~~~~~~ :52:20: warning: unused variable 'kPathDRMRoot' [-Wunused-variable] 52 | static const char *kPathDRMRoot = "/sys/class/drm"; | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:559:13: warning: unused function 'fileExists' [-Wunused-function] 559 | static bool fileExists(char const *filename) | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:52:20: warning: unused variable 'kPathDRMRoot' [-Wunused-variable] 52 | static const char *kPathDRMRoot = "/sys/class/drm"; | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/alt_rsmi.cc:559:13: warning: unused function 'fileExists' [-Wunused-function] 559 | static bool fileExists(char const *filename) | ^~~~~~~~~~ 6 warnings generated when compiling for gfx1201. 6 warnings generated when compiling for gfx1100. 6 warnings generated when compiling for gfx1102. 6 warnings generated when compiling for gfx1030. 6 warnings generated when compiling for gfx1200. 6 warnings generated when compiling for gfx908. 6 warnings generated when compiling for gfx1101. 6 warnings generated when compiling for gfx942. 6 warnings generated when compiling for gfx906. 6 warnings generated when compiling for gfx90a. [ 50%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/argcheck.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/argcheck.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/argcheck.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/argcheck.cc.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/argcheck.cc 6 warnings generated when compiling for host. [ 51%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/api_trace.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/api_trace.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/api_trace.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/api_trace.cc.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/api_trace.cc In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/argcheck.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/argcheck.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77t:18: warning: unused variable 'y' [-Wunused-variable] 77y | , u int32h_t y, eheaad, madntis,sa; | ^mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/argcheck.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/argcheck.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_In file included from t y, he/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/argcheck.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: aIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18d: warning: unused variable 'y' [-Wunused-variable] 77 | , ui nt32_t y,m head, maanntissa; | tis ^sa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/argcheck.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/argcheck.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/argcheck.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/argcheck.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/argcheck.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/argcheck.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/argcheck.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/info.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/argcheck.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/argcheck.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/argcheck.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/argcheck.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/argcheck.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/argcheck.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/argcheck.h::10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:1014: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: : unused function 'log2i' [-Wunused-function] 44 | stIn file included from atic lo/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.hng log2i(lo:ng n) { 38| ^~~~~ : In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/argcheck.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/argcheck.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/argcheck.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/argcheck.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/argcheck.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/argcheck.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/argcheck.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ 2 warnings generated when compiling for host. 2 warnings generated when compiling for gfx90a. 2 warnings generated when compiling for gfx1102. 2 warnings generated when compiling for gfx1101. 2 warnings generated when compiling for gfx906. 2 warnings generated when compiling for gfx1200. 2 warnings generated when compiling for gfx908. 2 warnings generated when compiling for gfx1030. 2 warnings generated when compiling for gfx1100. 2 warnings generated when compiling for gfx942. 2 warnings generated when compiling for gfx1201. [ 51%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/ibvsymbols.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/ibvsymbols.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/ibvsymbols.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/ibvsymbols.cc.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/ibvsymbols.cc In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/api_trace.cc:3: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/api_trace.cc:3: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/api_trace.cc:3: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/api_trace.cc:3: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/api_trace.cc:3: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/api_trace.cc:3: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/api_trace.cc:3: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/api_trace.cc:3: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/api_trace.cc:3: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/api_trace.cc:3: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/api_trace.cc:3: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for host. 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx942. 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx1200. 1 warning generated when compiling for gfx1201. 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx908. [ 51%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/ibvwrap.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/ibvwrap.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/ibvwrap.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/ibvwrap.cc.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/ibvwrap.cc In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/ibvsymbols.cc:67: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/ibvsymbols.cc:67: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/ibvsymbols.cc:67: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/ibvsymbols.cc:67: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/ibvsymbols.cc:67: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/ibvsymbols.cc:67: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/ibvsymbols.cc:67: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/ibvsymbols.cc:67: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx906. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/ibvsymbols.cc:67: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/ibvsymbols.cc:67: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/ibvsymbols.cc:67: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for host. 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx1200. 1 warning generated when compiling for gfx942. 1 warning generated when compiling for gfx1201. 1 warning generated when compiling for gfx1100. [ 51%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/ipcsocket.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/ipcsocket.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/ipcsocket.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/ipcsocket.cc.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/ipcsocket.cc In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/ibvwrap.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/ibvwrap.h:21: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/ibvwrap.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/ibvwrap.h:21: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/ibvwrap.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/ibvwrap.h:21: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/ibvwrap.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/ibvwrap.h:21: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/ibvwrap.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/ibvwrap.h:21: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/ibvwrap.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/ibvwrap.h:21: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/ibvwrap.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/ibvwrap.h:21: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/ibvwrap.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/ibvwrap.h:21: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/ibvwrap.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/ibvwrap.h:21: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/ibvwrap.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/ibvwrap.h:21: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/ibvwrap.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/ibvwrap.h:21: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for host. 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx1200. 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx942. 1 warning generated when compiling for gfx1201. 1 warning generated when compiling for gfx90a. [ 52%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/npkit.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/npkit.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/npkit.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/npkit.cc.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/npkit.cc In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/ipcsocket.cc:8: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/ipcsocket.cc:8: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/ipcsocket.cc:8: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/ipcsocket.cc:8: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/ipcsocket.cc:8: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/ipcsocket.cc:8: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/ipcsocket.cc:8: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/ipcsocket.cc:8: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | statIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/ipcsocket.cc:8: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ ic long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/ipcsocket.cc:8: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/ipcsocket.cc:8: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for host. 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx1200. 1 warning generated when compiling for gfx942. 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx1201. [ 52%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/nvmlwrap_stub.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/nvmlwrap_stub.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/nvmlwrap_stub.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/nvmlwrap_stub.cc.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/nvmlwrap_stub.cc In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/npkit.cc:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/npkit/npkit.h:16: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/npkit.cc:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/npkit/npkit.h:16: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/npkit.cc:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/npkit/npkit.h:16: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/npkit.cc:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/npkit/npkit.h:16: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/npkit.cc:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/npkit/npkit.h:16: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa;In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/npkit.cc:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/npkit/npkit.h:16: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/npkit.cc:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/npkit/npkit.h:16: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, man/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/npkit.cc:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/npkit/npkit.h:16: tIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.hi:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.hs:77:18: warning: unused variable 'y' [-Wunused-variable] s77 | uiant32_t y, ;head, mant issa; | ^ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/npkit.cc:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/npkit/npkit.h:16: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/npkit.cc:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/npkit/npkit.h:16: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/npkit.cc:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/npkit/npkit.h:16: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/npkit.cc:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/npkit.cc:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/npkit.cc:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/npkit.cc:In file included from 11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13:/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/npkit.cc: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ 11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/npkit.cc:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/npkit.cc:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/npkit.cc:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/npkit.cc:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long lIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/npkit.cc:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ og2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/npkit.cc:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ 2 warnings generated when compiling for gfx908. 2 warnings generated when compiling for gfx1030. 2 warnings generated when compiling for gfx90a. 2 warnings generated when compiling for gfx1101. 2 warnings generated when compiling for gfx1200. 2 warnings generated when compiling for gfx1100. 2 warnings generated when compiling for gfx942. 2 warnings generated when compiling for gfx906. 2 warnings generated when compiling for gfx1102. 2 warnings generated when compiling for gfx1201. [ 52%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/param.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/param.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/param.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/param.cc.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/param.cc 2 warnings generated when compiling for host. [ 52%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/profiler.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/profiler.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/profiler.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/profiler.cc.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/profiler.cc In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/profiler.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/profiler.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/proxy.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/profiler.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/profiler.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/proxy.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ [ 53%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/rocm_smi_wrap.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/rocm_smi_wrap.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/rocm_smi_wrap.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/rocm_smi_wrap.cc.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/rocm_smi_wrap.cc In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/profiler.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/profiler.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/proxy.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/profiler.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/profiler.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/proxy.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/profiler.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/profiler.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/proxy.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/profiler.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/profiler.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/proxy.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/profiler.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/profiler.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/proxy.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/profiler.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/profiler.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/proxy.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/profiler.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/profiler.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/proxy.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] In file included from 77/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/profiler.cc:7: | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/profiler.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/proxy.h: ui13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: n/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:t32_t y, head, mant18: iwarning: unused variable 'y' [-Wunused-variable] 77 | s uisnat;3 2 _| ^ t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/profiler.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/profiler.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/proxy.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/profiler.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/profiler.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/proxy.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/info.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/profiler.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/profiler.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/proxy.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/info.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ 2 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/profiler.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/profiler.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/proxy.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/info.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/profiler.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/profiler.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/proxy.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/info.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/profiler.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/profiler.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/proxy.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/info.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/profiler.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/profiler.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/proxy.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/info.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/profiler.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/profiler.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/proxy.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/info.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/profiler.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/profiler.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/proxy.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/info.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/profiler.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/profiler.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/proxy.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/info.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/profiler.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/profiler.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/proxy.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/info.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/profiler.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/profiler.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/proxy.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/info.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ 2 warnings generated when compiling for gfx1030. 2 warnings generated when compiling for gfx1101. 2 warnings generated when compiling for gfx908. 2 warnings generated when compiling for gfx942. 2 warnings generated when compiling for gfx906. 2 warnings generated when compiling for gfx1201. 22 warnings generated when compiling for gfx1100. warnings generated when compiling for gfx1102. 2 warnings generated when compiling for gfx90a. 2 warnings generated when compiling for gfx1200. [ 53%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/rocmwrap.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/rocmwrap.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/rocmwrap.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/rocmwrap.cc.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/rocmwrap.cc In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/rocm_smi_wrap.cc:24: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/rocm_smi_wrap.cc:24: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/rocm_smi_wrap.cc:24: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/rocm_smi_wrap.cc:24: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/rocm_smi_wrap.cc:24: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/rocm_smi_wrap.cc:24: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/rocm_smi_wrap.cc:24: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/rocm_smi_wrap.cc:24: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/rocm_smi_wrap.cc:24: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/rocm_smi_wrap.cc:24: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/rocm_smi_wrap.cc:24: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for host. 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx1200. 11 warning generated when compiling for gfx1102 warning. generated when compiling for gfx90a. 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx1101. 11 warning generated when compiling for gfx1201. warning generated when compiling for gfx942. 1 warning generated when compiling for gfx1100. [ 53%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/roctx.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/roctx.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/roctx.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/roctx.cc.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/roctx.cc [ 53%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/shmutils.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/shmutils.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/shmutils.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/shmutils.cc.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/shmutils.cc In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/roctx.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/roctx.h:18: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/roctx.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/roctx.h:18: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/roctx.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/roctx.h:18: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/roctx.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/roctx.h:18: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/roctx.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/roctx.h:18: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/roctx.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/roctx.h:18: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/roctx.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/roctx.h:18: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14In file included from : /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18:/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/roctx.cc :7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/roctx.h:18: In file included from warning: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/roctx.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/roctx.h:18: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/roctx.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/roctx.h:18: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/roctx.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/roctx.h:18: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/shmutils.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/shmutils.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/shmutils.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/shmutils.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/shmutils.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/shmutils.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 1 warning generated when compiling for host. 11 warning generated when compiling for gfx1201. warning generated when compiling for gfx1030. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/shmutils.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 1 warning generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/shmutils.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 1 warning generated when compiling for gfx1200. 1 warning generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/shmutils.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/shmutils.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/shmutils.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/shmutils.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/shmutils.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx908. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/shmutils.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx942. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/shmutils.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/shmutils.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/shmutils.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/shmutils.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/shmutils.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ [ 54%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/signals.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/signals.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/signals.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/signals.cc.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/signals.cc In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/shmutils.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/shmutils.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/shmutils.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ 2 warnings generated when compiling for gfx1102. 2 warnings generated when compiling for gfx1030. 2 warnings generated when compiling for host. 2 warnings generated when compiling for gfx1100. 2 warnings generated when compiling for gfx90a. 2 warnings generated when compiling for gfx1101. 2 warnings generated when compiling for gfx908. 2 warnings generated when compiling for gfx906. 2 warnings generated when compiling for gfx1201. 2 warnings generated when compiling for gfx1200. 2 warnings generated when compiling for gfx942. [ 54%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/socket.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/socket.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/socket.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/socket.cc.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/socket.cc /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/socket.cc:602:8: warning: unused variable 'line' [-Wunused-variable] 602 | char line[SOCKET_NAME_MAXLEN+1]; | ^~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/socket.cc:602:8: warning: unused variable 'line' [-Wunused-variable] 602 | char line[SOCKET_NAME_MAXLEN+1]; | ^~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/socket.cc:602:8: warning: unused variable 'line' [-Wunused-variable] 602 | char line[SOCKET_NAME_MAXLEN+1]; | ^~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/socket.cc:602:8: warning: unused variable 'line' [-Wunused-variable] 602 | char line[SOCKET_NAME_MAXLEN+1]; | ^~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/socket.cc:602:8: warning: unused variable 'line' [-Wunused-variable] 602 | char line[SOCKET_NAME_MAXLEN+1]; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/socket.cc:9: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/socket.cc:9: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/socket.cc:9: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/socket.cc:9: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/socket.cc:602:8: warning: unused variable 'line' [-Wunused-variable] 602 | char line[SOCKET_NAME_MAXLEN+1]; | ^~~~ [ 54%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/strongstream.cc.o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/socket.cc:602:8: warning: unused variable 'line' [-Wunused-variable] 602 | char line[SOCKET_NAME_MAXLEN+1]; | ^~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/socket.cc:9: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/strongstream.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/strongstream.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/strongstream.cc.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/strongstream.cc /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/socket.cc:602:8: warning: unused variable 'line' [-Wunused-variable] 602 | char line[SOCKET_NAME_MAXLEN+1]; | ^~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/socket.cc:602/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/socket.cc:8: warning: unused variable 'line' [-Wunused-variable] 602 | char line[SOCKET_NAME_MAXLEN+1]; | ^~~~ :602:8: warning: unused variable 'line' [-Wunused-variable] 602 | char line[SOCKET_NAME_MAXLEN+1]; /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/socket.cc | :602:8: warning: unused variable 'line' [-Wunused-variable] 602 | char line[SOCKET_NAME_MAXLEN+1]; | ^~~~ ^~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/socket.cc:9: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/socket.cc:9: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/socket.cc:9: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/socket.cc:9: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/socket.cc:9: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/socket.cc:9: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ 2 warnings generated when compiling for gfx1102. 2 warnings generated when compiling for gfx908. 2 warnings generated when compiling for gfx1100. 2 warnings generated when compiling for gfx90a. 2 warnings generated when compiling for gfx1200. 2 warnings generated when compiling for gfx1101. 2 warnings generated when compiling for gfx906. 2 warnings generated when compiling for gfx942. 2 warnings generated when compiling for gfx1201. 2 warnings generated when compiling for gfx1030. 2 warnings generated when compiling for host. [ 54%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/tuner.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/tuner.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/tuner.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/tuner.cc.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/tuner.cc [ 55%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/utils.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/utils.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/utils.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/utils.cc.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/utils.cc In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/tuner.cc:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/tuner.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/tuner.cc:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/tuner.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/tuner.cc:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/tuner.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/tuner.cc:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/tuner.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/tuner.cc:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/tuner.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/tuner.cc:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/tuner.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/tuner.cc:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/tuner.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/tuner.cc:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/tuner.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/tuner.cc:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/tuner.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/tuner.cc:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/tuner.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/tuner.cc:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/tuner.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/tuner.cc:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/tuner.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/tuner.cc:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/tuner.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/tuner.cc:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/tuner.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/tuner.cc:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/tuner.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/tuner.cc:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/tuner.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/tuner.cc:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/tuner.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/tuner.cc:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/tuner.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/tuner.cc:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/tuner.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/tuner.cc:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/tuner.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/tuner.cc:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/tuner.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/tuner.cc:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/tuner.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ 2 warnings generated when compiling for host. 2 warnings generated when compiling for gfx906. 2 warnings generated when compiling for gfx1101. 2 warnings generated when compiling for gfx90a. 2 warnings generated when compiling for gfx1102. 2 warnings generated when compiling for gfx1200. 2 warnings generated when compiling for gfx1030. 2 warnings generated when compiling for gfx1100. 2 warnings generated when compiling for gfx908. 2 warnings generated when compiling for gfx1201. 2 warnings generated when compiling for gfx942. [ 55%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_lifecycle.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_lifecycle.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_lifecycle.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_lifecycle.cc.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/utils.cc:7: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/utils.cc:7: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/utils.cc:7: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/utils.cc:7: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/utils.cc:7: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | sIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/utils.cc:7: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ tatic long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/utils.cc:7: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/utils.cc:7: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/utils.cc:7: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/utils.cc:7: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/utils.cc:7: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for host. 11 warning generated when compiling for gfx908. warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx942. 1 warning generated when compiling for gfx1201. 1 warning generated when compiling for gfx1200. 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx1100. [ 55%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_parser.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_parser.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_parser.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_parser.cc.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:19: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:19: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:19: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:19: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:19: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:19: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:19: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:19: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:19: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:19: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:19: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/graph.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:517:10: warning: unused variable 'nBytes' [-Wunused-variable] 517 | size_t nBytes = count * ncclTypeSize(dataType); | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:517:10: warning: unused variable 'nBytes' [-Wunused-variable] 517 | size_t nBytes = count * ncclTypeSize(dataType); | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:517:10: warning: unused variable 'nBytes' [-Wunused-variable] 517 | size_t nBytes = count * ncclTypeSize(dataType); | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:517:10: warning: unused variable 'nBytes' [-Wunused-variable] 517 | size_t nBytes = count * ncclTypeSize(dataType); | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:517:10: warning: unused variable 'nBytes' [-Wunused-variable] 517 | size_t nBytes = count * ncclTypeSize(dataType); | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:517:10: warning: unused variable 'nBytes' [-Wunused-variable] 517 | size_t nBytes = count * ncclTypeSize(dataType); | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:517:10: warning: unused variable 'nBytes' [-Wunused-variable] 517 | size_t nBytes = count * ncclTypeSize(dataType); | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:517:10: warning: unused variable 'nBytes' [-Wunused-variable] 517 | size_t nBytes = count * ncclTypeSize(dataType); | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:517:10: warning: unused variable 'nBytes' [-Wunused-variable] 517 | size_t nBytes = count * ncclTypeSize(dataType); | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:517:10: warning: unused variable 'nBytes' [-Wunused-variable] 517 | size_t nBytes = co/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:517:10: warning: unused variable 'nBytes' [-Wunused-variable] 517 | size_t nBytes = count * ncclTypeSize(dataType); | ^~~~~~ unt * ncclTypeSize(dataType); | ^~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:17: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:19: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:22: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:33:20: warning: unused variable 'mscclAlgoFilePathEnv' [-Wunused-variable] 33 | static const char* mscclAlgoFilePathEnv = "MSCCL_ALGO_FILE_PATH"; | ^~~~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:17: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:19: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struIn file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:17: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:19: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:22: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:33:20: warning: unused variable 'mscclAlgoFilePathEnv' [-Wunused-variable] 33 | static const char* mscclAlgoFilePathEnv = "MSCCL_ALGO_FILE_PATH"; | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:17: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:19: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct nccclt ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:22: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:33:20: warning: unused variable 'mscclAlgoFilePathEnv' [-Wunused-variable] 33 | static const char* mscclAlgoFilePathEnv = "MSCCL_ALGO_FILE_PATH"; | ^~~~~~~~~~~~~~~~~~~~ TopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:22: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:33:20: warning: unused variable 'mscclAlgoFilePathEnv' [-Wunused-variable] 33 | static const char* mscclAlgoFilePathEnv = "MSCCL_ALGO_FILE_PATH"; | ^~~~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:17: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:19: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:22: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:33:20: warning: unused variable 'mscclAlgoFilePathEnv' [-Wunused-variable] 33 | static const char* mscclAlgoFilePathEnv = "MSCCL_ALGO_FILE_PATH"; | ^~~~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:17: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:19: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.hIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:17: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long lo:225g:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 2252 | static ncclReisult_t ncc(lTopoRankTolIndexong n) {(s truct nc clTopoSyst| em* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:19: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t nintc dev, inct* rankl) { | ^~~~~~~~~~~~~~~~~T /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:o21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] p248 | statioc ncclReRsult_t ancclTopnoIdTkToIndoeNetDev(xs(struct ncclTopotruct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const charSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclRes*u gcn)l { | t ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:_14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function]t 271 | s tatinc flcoat nccclTopolNVLinTkBw(oint cpudaCoompCapI) { | d ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:T282:13: warning: ounused function 'isPow2' [-Wunused-function] Ne282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val,t Dev(istrunct tnccl TopopSystoem* wsyst2em), in t64_{t id , int * n| etD ^~~~~~~~~~ev) { In file included from | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc 261 | s:tat22ic : floa/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl/msccl_parser.ht ncc:lT75:opoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkB21w: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function]( 75i | stantic tncclR esulct_t umsccdlXmlGaetAtCtrIont(smtrucpt msCcclXamlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl/msccl_parser.hp) :{ 82| ^~~~~~~~~~~~~~~~ :/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:21282:13:: warning: unused function 'isPow2' [-Wunused-function] warning: 282 | unused function 'mscclXmlGetAttrInt64' [-Wunused-function]sta tic boo82 | static ncclResult_tl ismPow2(int svalc) {c | l ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.hX:285m:12:l GetAttrInt64(struct mscclXmlNode* nodewarning: ,unused function 'mirrorBits' [-Wunused-function] 285 | sctatoic int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:22: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static nncst clResult_t cmhars* acttrcNamleX, imnt6l4_tG* vealute) {A | t ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl/msccl_parser.ht:89r:21:I warning: unused function 'mscclXmlFindTag' [-Wunused-function] n 89t | st(atisc ntcruct mscclXmlclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName,No de*s notde, rconust cchatr* attmrNasmec, icnt*l vaXluem) {l N| ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl/msccl_parser.ho:82d:21:e warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function]** node) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:33:20: 82 | warning: staunused variable 'mscclAlgoFilePathEnv' [-Wunused-variable]tic nc clResu33lt_ | t mssctclaXmltGetic constAttrInt64(struct mscclXmlNode* node, const char* attrName, i chnar*t ms6ccl4Algo_FilteP*athE nv v= "aMSCClL_AuLGOe_F)ILE_ PAT{H | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h";: | 89 ^~~~~~~~~~~~~~~~~~~~ :21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:33:20: warning: unused variable 'mscclAlgoFilePathEnv' [-Wunused-variable] 33 | static const char* mscclAlgoFilePathEnv = "MSCCL_ALGO_FILE_PATH"; | ^~~~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:17: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:19: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:22: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:33:20: warning: unused variable 'mscclAlgoFilePathEnv' [-Wunused-variable] 33 | static const char* mscclAlgoFilePathEnv = "MSCCL_ALGO_FILE_PATH"; | ^~~~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:17: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:19: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:225:21In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:17: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, i13n: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:19: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:22: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:33:20: warning: unused variable 'mscclAlgoFilePathEnv' [-Wunused-variable] 33 | static const char* mscclAlgoFilePathEnv = "MSCCL_ALGO_FILE_PATH"; | ^~~~~~~~~~~~~~~~~~~~ t* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | statiIn file included from c/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:17: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:19: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:22: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:33:20: warning: unused variable 'mscclAlgoFilePathEnv' [-Wunused-variable] 33 | static const char* mscclAlgoFilePathEnv = "MSCCL_ALGO_FILE_PATH"; | ^~~~~~~~~~~~~~~~~~~~ bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:22: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_lifecycle.cc:33:20: warning: unused variable 'mscclAlgoFilePathEnv' [-Wunused-variable] 33 | static const char* mscclAlgoFilePathEnv = "MSCCL_ALGO_FILE_PATH"; | ^~~~~~~~~~~~~~~~~~~~ 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx1200. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:16: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 15 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:16: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:16: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:16: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:16: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:16: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:16: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:16: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:16: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:16: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:16: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:712:16: warning: unused variable 'ret' [-Wunused-variable] 712 | ncclResult_t ret = ncclSuccess; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:724:16: warning: unused variable 'ret' [-Wunused-variable] 724 | ncclResult_t ret = ncclSuccess; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:712:16: warning: unused variable 'ret' [-Wunused-variable] 712 | ncclResult_t ret = ncclSuccess; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:712:16: warning: unused variable 'ret' [-Wunused-variable] 712 | ncclResult_t ret = ncclSuccess; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:724:16: warning: unused variable 'ret' [-Wunused-variable] 724 | ncclResult_t ret = ncclSuccess; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:724:16: warning: unused variable 'ret' [-Wunused-variable] 724 | ncclResult_t ret = ncclSuccess; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:712:16: warning: unused variable 'ret' [-Wunused-variable] 712 | ncclResult_t ret = ncclSuccess; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:712:16: warning: unused variable 'ret' [-Wunused-variable] 712 | ncclResult_t ret = ncclSuccess; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:724:16: warning: unused variable 'ret' [-Wunused-variable] 724 | ncclResult_t ret = ncclSuccess; | ^~~ :724:16: warning: unused variable 'ret' [-Wunused-variable] 724 | ncclResult_t ret = ncclSuccess; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:712:16: warning: unused variable 'ret' [-Wunused-variable] 712 | ncclResult_t ret = ncclSuccess; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:712:16: warning: unused variable 'ret' [-Wunused-variable] 712 | ncclResult_t ret = ncclSuccess; | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:724:16: warning: unused variable 'ret' [-Wunused-variable] 724 | ncclResult_t ret = ncclSuccess; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:724:16: warning: unused variable 'ret' [-Wunused-variable] 724 | ncclResult_t ret = ncclSuccess; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:712:16: warning: unused variable 'ret' [-Wunused-variable] 712 | ncclResult_t ret = ncclSuccess; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:724:16: warning: unused variable 'ret' [-Wunused-variable] 724 | ncclResult_t ret = ncclSuccess; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:712:16: warning: unused variable 'ret' [-Wunused-variable] 712 | ncclResult_t ret = ncclSuccess; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:724:16: warning: unused variable 'ret' [-Wunused-variable] 724 | ncclResult_t ret = ncclSuccess; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:712:16: warning: unused variable 'ret' [-Wunused-variable] 712 | ncclResult_t ret = ncclSuccess; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:724:16: warning: unused variable 'ret' [-Wunused-variable] 724 | ncclResult_t ret = ncclSuccess; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:712:16: warning: unused variable 'ret' [-Wunused-variable] 712 | ncclResult_t ret = ncclSuccess; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:724:16: warning: unused variable 'ret' [-Wunused-variable] 724 | ncclResult_t ret = ncclSuccess; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_parser.cc:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ 4 warnings generated when compiling for gfx1200. 4 warnings generated when compiling for gfx906. 4 warnings generated when compiling for gfx1201. 4 warnings generated when compiling for gfx90a. 4 warnings generated when compiling for gfx1030. 4 warnings generated when compiling for gfx908. 4 warnings generated when compiling for gfx1100. 4 warnings generated when compiling for gfx1101. 4 warnings generated when compiling for gfx942. 4 warnings generated when compiling for gfx1102. 4 warnings generated when compiling for host. [ 55%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_setup.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_setup.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_setup.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_setup.cc.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc 15 warnings generated when compiling for host. [ 56%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_status.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_status.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_status.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_status.cc.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_status.cc In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:128:27: warning: unused variable 'threadLocalStatus' [-Wunused-variable] 128 | mscclThreadLocalStatus& threadLocalStatus = mscclGetThreadLocalStatus(); | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:128:27: warning: unused variable 'threadLocalStatus' [-Wunused-variable] 128 | mscclThreadLocalStatus& threadLocalStatus = mscclGetThreadLocalStatus(); | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:128:27: warning: unused variable 'threadLocalStatus' [-Wunused-variable] 128 | mscclThreadLocalStatus& threadLocalStatus = mscclGetThreadLocalStatus(); | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:128:27: warning: unused variable 'threadLocalStatus' [-Wunused-variable] 128 | mscclThreadLocalStatus& threadLocalStatus = mscclGetThreadLocalStatus(); | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:128:27: warning: unused variable 'threadLocalStatus' [-Wunused-variable] 128 | mscclThreadLocalStatus& threadLocalStatus = mscclGetThreadLocalStatus(); | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:128:27: warning: unused variable 'threadLocalStatus' [-Wunused-variable] 128 | mscclThreadLocalStatus& threadLocalStatus = mscclGetThreadLocalStatus(); | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:128:27: warning: unused variable 'threadLocalStatus' [-Wunused-variable] 128 | mscclThreadLocalStatus& threadLocalStatus = mscclGetThreadLocalStatus(); | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:128:27: warning: unused variable 'threadLocalStatus' [-Wunused-variable] 128 | mscclThreadLocalStatus& threadLocalStatus = mscclGetThreadLocalStatus(); | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:128:27: warning: unused variable 'threadLocalStatus' [-Wunused-variable] 128 | mscclThreadLocalStatus& threadLocalStatus = mscclGet/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:128:27: warning: unused variable 'threadLocalStatus' [-Wunused-variable] 128 | mscclThreadLocalStatus& threadLocalStatus = mscclGetThreadLocalStatus(); | ^~~~~~~~~~~~~~~~~ ThreadLocalStatus(); | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:128:27: warning: unused variable 'threadLocalStatus' [-Wunused-variable] 128 | mscclThreadLocalStatus& threadLocalStatus = mscclGetThreadLocalStatus(); | ^~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_status.cc:6: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl/msccl_status.h:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl/msccl_struct.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_status.cc:6: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl/msccl_status.h:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl/msccl_struct.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_setup.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/channel.h:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_status.cc:6: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl/msccl_status.h:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl/msccl_struct.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_status.cc:6: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl/msccl_status.h:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl/msccl_struct.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_status.cc:6: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl/msccl_status.h:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl/msccl_struct.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_status.cc:6: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl/msccl_status.h:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl/msccl_struct.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_status.cc:6: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl/msccl_status.h:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl/msccl_struct.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 3 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_status.cc:6: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl/msccl_status.h:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl/msccl_struct.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 3 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_status.cc:6: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl/msccl_status.h:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl/msccl_struct.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_status.cc:6: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl/msccl_status.h:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl/msccl_struct.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/misc/msccl/msccl_status.cc:6: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl/msccl_status.h:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/msccl/msccl_struct.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 3 warnings generated when compiling for gfx906. 3 warnings generated when compiling for gfx1101. 3 warnings generated when compiling for gfx1102. 3 warnings generated when compiling for gfx1030. 3 warnings generated when compiling for gfx1100. 3 warnings generated when compiling for gfx1201. 3 warnings generated when compiling for gfx942. 3 warnings generated when compiling for gfx908. 11 warning generated when compiling for gfx90a. warning generated when compiling for gfx906. 3 warnings generated when compiling for host. 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx1201. [ 56%] Building CXX object CMakeFiles/rccl.dir/hipify/src/transport/coll_net.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/transport/coll_net.cc.o -MF CMakeFiles/rccl.dir/hipify/src/transport/coll_net.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/transport/coll_net.cc.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/coll_net.cc 1 warning generated when compiling for gfx1200. 1 warning generated when compiling for gfx942. 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for host. [ 56%] Building CXX object CMakeFiles/rccl.dir/hipify/src/transport/generic.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/transport/generic.cc.o -MF CMakeFiles/rccl.dir/hipify/src/transport/generic.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/transport/generic.cc.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/generic.cc In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/coll_net.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/coll_net.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/coll_net.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/coll_net.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/coll_net.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/coll_net.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/coll_net.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/coll_net.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/coll_net.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/coll_net.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/coll_net.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/generic.cc:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/generic.cc:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/generic.cc:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/generic.cc:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/generic.cc:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.hIn file included from :14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/generic.cc:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h ui:nt32_t y14, head, ma: ntissa; In file included from | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/generic.cc:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/generic.cc:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/generic.cc:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/generic.cc:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/generic.cc:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/generic.cc:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ 2 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/generic.cc:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long nIn file included from ) { | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/generic.cc:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static lonIn file included from g log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/generic.cc:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/generic.cc:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/generic.cc:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/generic.cc:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/generic.cc:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/generic.cc:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/generic.cc:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/generic.cc:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/coll_net.cc:13: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ 22 warnings generated when compiling for gfx90a. warnings generated when compiling for gfx906. 2 warnings generated when compiling for gfx1201. 2 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/coll_net.cc:13: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/coll_net.cc:13: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/coll_net.cc:13: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ 2 warnings generated when compiling for gfx1030. In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/coll_net.cc:13: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/coll_net.cc:13: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] :183 | gd r_info_warning: t infunused variable 'mh' [-Wunused-variable]o; | ^~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:185 :12: warning: unused variable 'mh' [-Wunused-variable]185 185 | | gdr_m h_t gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] mh; | ^~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ _desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/coll_net.cc:13: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ 2 warnings generated when compiling for gfx1200. 2 warnings generated when compiling for gfx1102. 2 warnings generated when compiling for gfx1100. 2 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/coll_net.cc:13: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; 2 warnings generated when compiling for gfx908. | ^~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/coll_net.cc:13: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/coll_net.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/coll_net.cc:10: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclRIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/coll_net.cc:13: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ esult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/coll_net.cc:13: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t isinze)f { oNC;CLC HE C| K( ^~~~com m->ncc/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.hlColl:Net185->t:est12(re:que st,warning: dounused variable 'mh' [-Wunused-variable]ne, si ze));185 r | et u rgn ndccr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:219lSu:cces19s; :} | ^~~~~~~~~~~warning: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.hunused variable 'md' [-Wunused-variable]:30: 21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | sta219tic | nc clResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHE gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHaCnK(dcolmm->enc;cl Co l| lN ^~et ->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/coll_net.cc:13: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/coll_net.cc:203:21: warning: unused function 'collNetDumpMap' [-Wunused-function] 203 | static ncclResult_t collNetDumpMap(struct connectMap* map) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/coll_net.cc:406:21: warning: unused function 'sharedBuffersGet' [-Wunused-function] 406 | static ncclResult_t sharedBuffersGet(struct ncclCollNetSharedRes* collNet, int type, int slot, int channel, int* offset) { | ^~~~~~~~~~~~~~~~ [ 56%] Building CXX object CMakeFiles/rccl.dir/hipify/src/transport/net_tmp.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/transport/net_tmp.cc.o -MF CMakeFiles/rccl.dir/hipify/src/transport/net_tmp.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/transport/net_tmp.cc.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_tmp.cc In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/coll_net.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/coll_net.cc:10: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/coll_net.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static leotPrnogpe rlties_ot* gpro2ps)i ({ NlCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int devong n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/coll_net.cc:10: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } , void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclCo| ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static nmm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclcclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProComm* comm, void* collComm) { NCCLpCerties(dHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNev, props)); return ncclSuccess; }et != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/coll_net.cc:13: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/coll_net.cc:203:21: warning: unused function 'collNetDumpMap' [-Wunused-function] 203 | static ncclResult_t collNetDumpMap(struct connectMap* map) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/coll_net.cc:406:21: warning: unused function 'sharedBuffersGet' [-Wunused-function] 406 | static ncclResult_t sharedBuffersGet(struct ncclCollNetSharedRes* collNet, int type, int slot, int channel, int* offset) { | ^~~~~~~~~~~~~~~~ | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:22In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/coll_net.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/coll_net.cc:10: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* da:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLta, sizCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclColle_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct nccMr(collComm, data, size, type, mhandle)); return ncclSulComm* comm, void* request, int* doccess;ne, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, siz)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/coll_net.cc:13: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/coll_net.cc:203:21: warning: unused function 'collNetDumpMap' [-Wunused-function] 203 | static ncclResult_t collNetDumpMap(struct connectMap* map) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/coll_net.cc:406:21: warning: unused function 'sharedBuffersGet' [-Wunused-function] 406 | static ncclResult_t sharedBuffersGet(struct ncclCollNetSharedRes* collNet, int type, int slot, int channel, int* offset) { | ^~~~~~~~~~~~~~~~ e_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); retuIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/coll_net.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/coll_net.cc:10: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetrn ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) Devices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) {{ NCCL CHNCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offECK(comm->ncclCollNet->closeColl(collComm)); return ncset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] clSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* c 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLomm, CHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.hvoid* listenComm) { NCCLCHECK(comm->ncclCollNet->closeLi:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/coll_net.cc:13: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/coll_net.cc:203:21: warning: unused function 'collNetDumpMap' [-Wunused-function] 203 | static ncclResult_t collNetDumpMap(struct connectMap* map) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/coll_net.cc:406:21: warning: unused function 'sharedBuffersGet' [-Wunused-function] 406 | static ncclResult_t sharedBuffersGet(struct ncclCollNetSharedRes* collNet, int type, int slot, int channel, int* offset) { | ^~~~~~~~~~~~~~~~ sten(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/coll_net.cc:13: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/coll_net.cc:203:21: warning: unused function 'collNetDumpMap' [-Wunused-function] 203 | static ncclResult_t collNetDumpMap(struct connectMap* map) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/coll_net.cc:406:21: warning: unused function 'sharedBuffersGet' [-Wunused-function] 406 | static ncclResult_t sharedBuffersGet(struct ncclCollNetSharedRes* collNet, int type, int slot, int channel, int* offset) { | ^~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/coll_net.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/coll_net.cc:10: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/coll_net.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/coll_net.cc:10: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NC/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/coll_net.cc:13: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/coll_net.cc:203:21: warning: unused function 'collNetDumpMap' [-Wunused-function] 203 | static ncclResult_t collNetDumpMap(struct connectMap* map) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/coll_net.cc:406:21: warning: unused function 'sharedBuffersGet' [-Wunused-function] 406 | static ncclResult_t sharedBuffersGet(struct ncclCollNetSharedRes* collNet, int type, int slot, int channel, int* offset) { | ^~~~~~~~~~~~~~~~ CLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/coll_net.cc:13: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/coll_net.cc:203:21: warning: unused function 'collNetDumpMap' [-Wunused-function] 203 | static ncclResult_t collNetDumpMap(struct connectMap* map) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/coll_net.cc:406:21: warning: unused function 'sharedBuffersGet' [-Wunused-function] 406 | static ncclResult_t sharedBuffersGet(struct ncclCollNetSharedRes* collNet, int type, int slot, int channel, int* offset) { | ^~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/coll_net.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/coll_net.cc:10: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeC22oll(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static n warnings generated when compiling for gfx906. cclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/coll_net.cc:13: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t nIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/coll_net.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/coll_net.cc:10: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NcclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/coll_net.cc:203:21: warning: unused function 'collNetDumpMap' [-Wunused-function] 203 | static ncclResult_t collNetDumpMap(struct connectMap* map) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/coll_net.cc:406:21: warning: unused function 'sharedBuffersGet' [-Wunused-function] 406 | static ncclResult_t sharedBuffersGetCC(LCHEsCK(cotmm->nrccluCollNet->devicesc(ndevt)); re turnn ncclScuccescs; } l| ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.hC:18o:l21:lN warning: etSharedRes* collNunused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { Net, int type, int slot, int channel, int* offset) { | ^~~~~~~~~~~~~~~~ CCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* hIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/coll_net.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/coll_net.cc:10: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm,andles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:22:21: data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, voi warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncd* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/coll_net.cc:13: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:163:clComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collN14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/coll_net.cc:203:21: warning: unused function 'collNetDumpMap' [-Wunused-function] 203 | static ncclResult_t collNetDumpMap(struct connectMap* map) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/coll_net.cc:406:21: warning: unused function 'sharedBuffersGet' [-Wunused-function] 406 | static ncclResult_t sharedBuffersGet(struct ncclCollNetSharedRes* collNet, int type, int slot, int channel, int* offset) { | ^~~~~~~~~~~~~~~~ etRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/coll_net.cc:13: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/coll_net.cc:203:21: warning: unused function 'collNetDumpMap' [-Wunused-function] 203 | static ncclResult_t collNetDumpMap(struct connectMap* map) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/coll_net.cc:406:21: warning: unused function 'sharedBuffersGet' [-Wunused-function] 406 | static ncclResult_t sharedBuffersGet(struct ncclCollNetSharedRes* collNet, int type, int slot, int channel, int* offset) { | ^~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/coll_net.cc:9: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/coll_net.cc:10: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, size_t size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/coll_net.cc:13: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/coll_net.cc:203:21: warning: unused function 'collNetDumpMap' [-Wunused-function] 203 | static ncclResult_t collNetDumpMap(struct connectMap* map) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/coll_net.cc:406:21: warning: unused function 'sharedBuffersGet' [-Wunused-function] 406 | static ncclResult_t sharedBuffersGet(struct ncclCollNetSharedRes* collNet, int type, int slot, int channel, int* offset) { | ^~~~~~~~~~~~~~~~ 22 warnings generated when compiling for gfx908. 22 warnings generated when compiling for gfx90a. 22 warnings generated when compiling for gfx1101. 22 warnings generated when compiling for gfx942. 22 warnings generated when compiling for gfx1200. 22 warnings generated when compiling for gfx1100. 22 warnings generated when compiling for gfx1102. 22 warnings generated when compiling for gfx1201. 22 warnings generated when compiling for gfx1030. 22 warnings generated when compiling for host. [ 57%] Building CXX object CMakeFiles/rccl.dir/hipify/src/transport/net_ib.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/transport/net_ib.cc.o -MF CMakeFiles/rccl.dir/hipify/src/transport/net_ib.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/transport/net_ib.cc.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_ib.cc In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_tmp.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_tmp.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_tmp.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_tmp.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_tmp.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_tmp.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_tmp.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_tmp.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_tmp.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_tmp.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_tmp.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_tmp.cc:15: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_tmp.cc:15: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_tmp.cc:15: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_tmp.cc:15: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_tmp.cc:15: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_tmp.cc:15: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_tmp.cc:15: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.hIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_tmp.cc:15: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ :In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_tmp.cc:15: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ 187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.hgdrH:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ andle; | ^~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_tmp.cc:15: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_tmp.cc:15: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:183:14: warning: unused variable 'info' [-Wunused-variable] 183 | gdr_info_t info; | ^~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:185:12: warning: unused variable 'mh' [-Wunused-variable] 185 | gdr_mh_t mh; | ^~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:187:9: warning: unused variable 'gdrMap' [-Wunused-variable] 187 | void *gdrMap; | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:219:19: warning: unused variable 'md' [-Wunused-variable] 219 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_tmp.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_tmp.cc:15: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_tmp.cc:21: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_tmp.cc:285:21: warning: unused function 'netDumpMap' [-Wunused-function] 285 | static ncclResult_t netDumpMap(struct connectMap* map) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_tmp.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_tmp.cc:15: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_tmp.cc:21: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResulIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_tmp.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_tmp.cc:15: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_tmp.cc:21: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(ct_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | stationst char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_tmp.cc:285:21: warning: unused function 'netDumpMap' [-Wunused-function] 285 | static ncclResult_t netDumpMap(struct connectMap* map) { | ^~~~~~~~~~ c ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_tmp.cc:285:21: warning: unused function 'netDumpMap' [-Wunused-function] 285 | static ncclResult_t netDumpMap(struct connectMap* map) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_tmp.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_tmp.cc:15: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_tmp.cc:21: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(intIn file included from cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_tmp.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_tmp.cc:285:21: warning: unused function 'netDumpMap' [-Wunused-function] 285 | static ncclResult_t netDumpMap(struct connectIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_tmp.cc:15: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_tmp.cc:21: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struMap* map) { | ^~~~~~~~~~ ct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | In file included from ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_tmp.cc:285:21: warning: unused function 'netDumpMap' [-Wunused-function] 285 | static ncclResult_t netDumpMa/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_tmp.cc:10p: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13(: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:s15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from t/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:r44:13: warning: unused function 'log2i' [-Wunused-function]u 44 | sctatitc l ong lcog2i(loong n) n{ | ^~~~~ nIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_tmp.cce:15: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:163c:14: warning: unused function 'ncclGdrInit' [-Wunused-function] t 163 | stMatic agdpr_t nc*clGd rImap) { | ^~~~~~~~~~nit() { | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_tmp.cc:21: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoId ToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_tmp.cc:285:21: warning: unused function 'netDumpMap' [-Wunused-function] 285 | static ncclResult_t netDumpMap(struct connectMap* map) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_tmp.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_tmp.cc:15: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_tmp.cc:21: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_tmp.cc:285:21: warning: unused function 'netDumpMap' [-Wunused-function] 285 | static ncclResult_t netDumpMap(struct connectMap* map) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_tmp.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_tmp.cc:15: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_tmp.cc:21: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_tmp.cc:285:21: warning: unused function 'netDumpMap' [-Wunused-function] 285 | static ncclResult_t netDumpMap(struct connectMap* map) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_tmp.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_tmp.cc:15: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_tmp.cc:21: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_tmp.cc:285:21: warning: unused function 'netDumpMap' [-Wunused-function] 285 | static ncclResult_t netDumpMap(struct connectMap* map) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_tmp.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_tmp.cc:15: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_tmp.cc:21: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* syst/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_tmp.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_tmp.cc:15: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/gdrwrap.h:163:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 163 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_tmp.cc:21: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncem, inct64_t idl, int* RnetDev) e{ | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.hs:261:14u:l twarning: _t ncclTopoIdToIunused function 'ncclTopoXGMISpeed' [-Wunused-function] n261 | stadtice float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static flx(ostruct ancclTotpoSyst em* systnem, iccntl tyTpe, into64_t id,p into* index) N{ | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.hV:225:21:Lin warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | kstatic BncclReswu(lint cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:t_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int13: warning: unused function 'isPow2' [-Wunused-function] 282 | srtatic abooln isPowk2(int, val) { in t| * index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236: ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_tmp.cc:285:21:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] warning: unused function 'netDumpMap' [-Wunused-function]236 | stat ic ncc lResu285lt_ | t ncsclTtopoDeavtToRiank(sct ruct nnccclTocpoSyslteRm*e sysstem,ul int tdev, _itnt* rnaetDnk)u m{p M | ap(struct connectMap* map) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* s ^~~~~~~~~~y stem, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_tmp.cc:285:21: warning: unused function 'netDumpMap' [-Wunused-function] 285 | static ncclResult_t netDumpMap(struct connectMap* map) { | ^~~~~~~~~~ 16 warnings generated when compiling for gfx1101. 16 warnings generated when compiling for gfx1102. 16 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_ib.cc:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 16 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_ib.cc:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 16 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_ib.cc:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 16 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_ib.cc:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_ib.cc:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 16 warnings generated when compiling for gfx906. 16 warnings generated when compiling for gfx1030. 16 warnings generated when compiling for gfx908. 16 warnings generated when compiling for gfx90a. In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_ib.cc/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_ib.cc:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_ib.cc:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ :12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_ib.cc:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_ib.cc:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.hIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_ib.cc:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/net.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ :15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_ib.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_ib.cc:30: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_ib.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_ib.cc:30: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_ib.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_ib.cc:30: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ rInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | staIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_ib.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_ib.cc:30: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) tic ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static{ | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToIn ncclResult_t xmlGetSubKvInt(stt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ ruct ncclXmlNode* node, const char* subName, stIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_ib.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:ru14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_ib.cc:30: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNodeName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ (struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_ib.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_ib.cc:30: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemovIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_ib.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_ib.cc:30: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFieNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResultndTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ _t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_ib.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_ib.cc:30: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* novConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | stade, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ tic ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_ib.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_ib.cc:30: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_ib.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_ib.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_ib.cc:30: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncc{ | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_ib.cc:30: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:76:21: warning: unused function 'xmlAlloc' [-Wunused-function] 76 | static ncclResult_t xmlAlloc(struct ncclXml** xml, int maxNodes) { | ^~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:111:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 111 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:118:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 118 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:125:21: warning: unused function 'xmlGetAttrLong' [-Wunused-function] 125 | static ncclResult_t xmlGetAttrLong(struct ncclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:133:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 133 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:140:21: warning: unused function 'xmlFindTag' [-Wunused-function] 140 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:152:21: warning: unused function 'xmlFindNextTag' [-Wunused-function] 152 | static ncclResult_t xmlFindNextTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode* prev, struct ncclXmlNode** node) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:164:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 164 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:180:21: warning: unused function 'xmlFindNode' [-Wunused-function] 180 | static ncclResult_t xmlFindNode(struct ncclXmlNode* parentNode, struct ncclXmlNode* searchNode, struct ncclXmlNode** node) { | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:203:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 203 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:216:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 216 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:228:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 228 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:241:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 241 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:254:21: warning: unused function 'xmlSetAttrLong' [-Wunused-function] 254 | static ncclResult_t xmlSetAttrLong(struct ncclXmlNode* node, const char* attrName, const int64_t value) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:267:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 267 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:279:21: warning: unused function 'xmlGetSub' [-Wunused-function] 279 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:305:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 305 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:312:21: warning: unused function 'xmlAddNode' [-Wunused-function] 312 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ lXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ arent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:334:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 334 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:347:21: warning: 'static' function 'xmlAddTree' declared in header file should be declared 'static inline' [-Wunneeded-internal-declaration] 347 | static ncclResult_t xmlAddTree(struct ncclXml* dst, struct ncclXmlNode* parent, struct ncclXmlNode* srcNode) { | ^~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:377:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 377 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/xml.h:390:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 390 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ 24 warnings generated when compiling for gfx908. 24 warnings generated when compiling for gfx906. 24 warnings generated when compiling for gfx90a. 24 warnings generated when compiling for gfx1201. 24 warnings generated when compiling for gfx1200. 24 warnings generated when compiling for gfx1101. 24 warnings generated when compiling for gfx1102. 24 warnings generated when compiling for gfx1100. 24 warnings generated when compiling for gfx942. 24 warnings generated when compiling for gfx1030. 16 warnings generated when compiling for host. [ 57%] Building CXX object CMakeFiles/rccl.dir/hipify/src/transport/net_socket.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/transport/net_socket.cc.o -MF CMakeFiles/rccl.dir/hipify/src/transport/net_socket.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/transport/net_socket.cc.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_socket.cc 24 warnings generated when compiling for host. [ 57%] Building CXX object CMakeFiles/rccl.dir/hipify/src/transport/nvls.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/transport/nvls.cc.o -MF CMakeFiles/rccl.dir/hipify/src/transport/nvls.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/transport/nvls.cc.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/nvls.cc In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_socket.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_socket.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_socket.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_socket.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_socket.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_socket.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_socket.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_socket.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_socket.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_socket.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_socket.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_socket.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_socket.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_socket.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_socket.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/nvls.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/nvls.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/nvls.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/nvls.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_socket.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/nvls.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, long log2head, mantissa; | ^ i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_socket.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_socket.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_socket.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_socket.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_socket.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/net_socket.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/nvls.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/nvls.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/nvls.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/nvls.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/nvls.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/nvls.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/nvls.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/nvls.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/nvls.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/nvls.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/nvls.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ 2 warnings generated when compiling for gfx1200. 2 warnings generated when compiling for gfx90a. 2 warnings generated when compiling for gfx1030. 2 warnings generated when compiling for gfx1101. 2 warnings generated when compiling for gfx942. 2 warnings generated when compiling for gfx906. 2 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/nvls.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ 2 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/nvls.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ 2In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/nvls.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/nvls.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/nvls.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ 2 warnings generated when compiling for gfx1100. 2 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/nvls.cc:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ 2 warnings generated when compiling for host. 2 warnings generated when compiling for gfx1101. 2 warnings generated when compiling for gfx942. 2 warnings generated when compiling for gfx1102. 2 warnings generated when compiling for gfx1200. 2 warnings generated when compiling for gfx90a. [ 57%] Building CXX object CMakeFiles/rccl.dir/hipify/src/transport/p2p.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/transport/p2p.cc.o -MF CMakeFiles/rccl.dir/hipify/src/transport/p2p.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/transport/p2p.cc.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/p2p.cc 2 warnings generated when compiling for gfx908. 2 warnings generated when compiling for gfx1030. 2 warnings generated when compiling for gfx906. 2 warnings generated when compiling for gfx1201. 2 warnings generated when compiling for gfx1100. [ 58%] Building CXX object CMakeFiles/rccl.dir/hipify/src/transport/shm.cc.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/src/transport/shm.cc.o -MF CMakeFiles/rccl.dir/hipify/src/transport/shm.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/transport/shm.cc.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/shm.cc In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/p2p.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/p2p.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/p2p.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/p2p.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/p2p.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/p2p.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/p2p.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/p2p.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/p2p.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/p2p.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/p2p.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/shm.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/p2p.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/p2p.cc:13: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/p2p.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/p2p.cc:13: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/p2p.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/p2p.cc:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/shm.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/shm.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/shm.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/shm.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/p2p.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/p2p.cc:13: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/p2p.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/p2p.cc:13: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTop/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/p2p.cc:o8: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.hI:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from d/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13:T warning: unused function 'log2i' [-Wunused-function] 44 | sotatic longI log2i(lonng n) { | ^~~~~ dIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/p2p.cc:13: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:214e:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | xstatic nccl(Result_t sncclTopoItdToIndex(strruct ncclToupoSystem* csystem, intt type, in ncclTopoSystem* systet64_t idm, int* index) { | ^~~~~~~~~~~~~~~~~, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:(225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] s225 | statitc ncclResulrt_t ncclToupoRankToIncdex(structt ncclTop oSystem* system, innt rank, In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/shm.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ ccilTopoSystem* system, int rank,nt* index) { i | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21:n warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | statIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/shm.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ * index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21tic nccl:Result_t n cclTopoDeIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/shm.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResuvToRlank(structt ncclTopoS_yst nccltem* system, inTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) t d{ev, in t* r ank) {| | ^~~~~~~~~~~~~~~~~ ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248 :21: warning: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.hunused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResu:lt_t 248ncclTo:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(structpoIdTo NetDevn(strucct ncclTcopoSysltem* sTystem,o int64_pt id, oint* neStDev)ystem* system, int64_t id, int* n {e | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.ht:261:14:D warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] e 261 | svtatic f)loat ncc{ | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function]lTopoXGMISpeed(const char* 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int c gcun) { d | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.ha:271:14: Cwarning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] o271 | stamtic pfloaCt ncaclToppoNV)Link Bw(i{nt c udaC omIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/p2p.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/p2p.cc:13: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRa| pCap ^~~~~~~~~~~~~~~~) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: :warning: unused function 'isPow2' [-Wunused-function] 282282 | st:atic13 boo:l is Pow2warning: (intunused function 'isPow2' [-Wunused-function] val ) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static in:t285:12: warning: unused function 'mirrorBits' [-Wunused-function] m nkToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ 285i | startic int rmirroorBits(int val, int pow2) { | ^~~~~~~~~~ rBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/shm.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/p2p.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/shm.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/p2p.cc:13: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/shm.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ clTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclToIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/p2p.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/p2p.cc:13: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevpoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ orBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/shm.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/p2p.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/p2p.cc:13: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/p2p.cc:8: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/p2p.cc:13: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:214:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 214 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:225:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 225 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:236:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 236 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:248:21: warning: unused function 'ncclTopoIdToNetDev' [-Wunused-function] 248 | static ncclResult_t ncclTopoIdToNetDev(struct ncclTopoSystem* system, int64_t id, int* netDev) { | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:261:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 261 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:271:14: warning: unused function 'ncclTopoNVLinkBw' [-Wunused-function] 271 | static float ncclTopoNVLinkBw(int cudaCompCap) { | ^~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:282:13: warning: unused function 'isPow2' [-Wunused-function] 282 | static bool isPow2(int val) { | ^~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/graph/topo.h:285:12: warning: unused function 'mirrorBits' [-Wunused-function] 285 | static int mirrorBits(int val, int pow2) { | ^~~~~~~~~~ 10 warnings generated when compiling for gfx1100. 10 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/shm.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/shm.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/shm.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ 10 warnings generated when compiling for gfx90a. 10 warnings generated when compiling for gfx908. 10 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/shm.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/shm.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/shm.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/shm.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/shm.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/shm.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ 10 warnings generated when compiling for gfx1201. 10 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/transport/shm.cc:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/comm.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/p2p.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/core.h:38: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/alloc.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/utils.h:44:13: warning: unused function 'log2i' [-Wunused-function] 44 | static long log2i(long n) { | ^~~~~ 10 warnings generated when compiling for gfx906. 10 warnings generated when compiling for gfx1102. 2 warnings generated when compiling for gfx1200. 10 warnings generated when compiling for gfx1200. 2 warnings generated when compiling for gfx942. 2 warnings generated when compiling for gfx906. 2 warnings generated when compiling for gfx1201. 2 warnings generated when compiling for gfx1100. 2 warnings generated when compiling for gfx908. 2 warnings generated when compiling for gfx1102. 2 warnings generated when compiling for gfx1101. 2 warnings generated when compiling for gfx1030. 2 warnings generated when compiling for gfx90a. 10 warnings generated when compiling for host. 2 warnings generated when compiling for host. [ 58%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_gather_sum_i8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_gather_sum_i8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_gather_sum_i8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_gather_sum_i8.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp [ 58%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_bf16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_bf16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_bf16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_bf16.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:1/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: : In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.hIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ :14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ :12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ a; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:20:15: warning: unused variable 'bid' [-Wunused-variable] 20 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:20:15: warning: unused variable 'bid' [-Wunused-variable] 20 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_b/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cppy:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from _/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ groupIn file included from (); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ warning: unused variable 'w' [-Wunused-variable] 75 | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ barriIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ er_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ const int w = threadIdx.x/WARP_SIZE; \ | ^ , flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ barrier_by_group(); | In file included from ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group'In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from 29 | const In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' : In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] In file included from 75 | b/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpparrie:r_by_/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 2: grIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from o/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :75:7: warning: unused variable 'w' [-Wunused-variable] 75 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173 : /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] b 75up(); | | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h: 29:a15: note: expanded from macro 'barrier_by_group' 29r | con rst intIn file included from ier_b barri w = thread/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrierint w = threadIdx.x/WARP_SIZE; \ | ^ _by_group(); In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.hy_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :29:15: note: expanded from macro 'barrier_by_group' 29 | er_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ const int w = threadIdx.x/WARP_SIZE; \ | ^ Idx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ const int w = threadIdx.x/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channIn file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_belId - work->channelLo; | ^~~ y_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_In file included from t data1, fla/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from g/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from 1/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h,:75In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :7: warning: unused variable 'w' [-Wunused-variable] 75d | a In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ barrtier_bay_grou2p(); , | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h :29:15:f /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: lnote: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:11: aexpanded from macro 'barrier_by_group'In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174 g: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h: 275:7: ;In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ warning: unused variable 'w' [-Wunused-variable]29 | 75 | | ba ^~~~~rri e r_b y/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h_gcroup(o:); n145 In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ | ^~~~~~~~~~~~~~~~~~ :s/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:28t15: note: : expanded from macro 'barrier_by_group' 29 i | warning: n conunused variable 'data2' [-Wunused-variable]tst i nt ww = t 145hrea= | dIdx .x/Wt ARP_hS IZEr; \ eu| ^ iIn file included from nt32_t data1, fladaIdx.xg/WARP_SIZE; \ | ^ 1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flaIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ g1, data2, f/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:l2: In file included from a/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:11: gIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.hIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 2:In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 174: ;/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145 :14: | ^~~~~ warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t datIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ a1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, dIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ ata2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: In file included from warning: unused variable 'data2' [-Wunused-variable] 145/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp: | 2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174 : /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h: 145:14uIn file included from :/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ warning: iunused variable 'data1' [-Wunused-variable] n145 | t 3uin2t32__t tdat a1,d flaag1t, a1, dafta2l, falagg2; 1 | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ data1, In file included from flag/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp1,In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ :dat2a2,: fIn file included from lag2/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ ;: 11| ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145: :28In file included from : warning: unused variable 'data2' [-Wunused-variable] , data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.hunused variable 'flag2' [-Wunused-variable]:173 : /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h: 75:145 7 | 145 | : u warning: in unused variable 'w' [-Wunused-variable]t3u In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 2_i tn dt75at3 | a 21In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ _t d a bt,a afrl1rag,i1,In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:20:15: warning: unused variable 'bid' [-Wunused-variable] 20 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ e dIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ flag1, data2, flag2; | ^~~~~ r/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ _by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | consIn file included from t/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ intIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ w = In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ threadIdx.x/WARP_SIZE; \ | ^ ata2, flag2;/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h :366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:In file included from 145/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ :35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cppIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_bIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, fy_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:20:15: warning: unused variable 'bid' [-Wunused-variable] 20 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ IZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80In file included from | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | consIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:20:15: warning: unused variable 'bid' [-Wunused-variable] 20 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ t int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from l/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hag1, data2, flag2; | ^~~~~ :11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WAIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ RP_SIZE; \ | ^ :19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmemIn file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvP/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:t2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19:.channelId - work->channelLo; | ^~~ warning: unused variable 'ptr' [-Wunused-variable] 271 | r(0)+ll128Offset; | ^~~ uint64_t* ptr = recvPtr(In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrieIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ r_by_group(); /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.xIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:20:15: warning: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ unused variable 'bid' [-Wunused-variable] 20In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ | const intIn file included from bid = ncclShmem.cIn file included from hannelId -/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27w:15:o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cppwarning: runused variable 'bid' [-Wunused-variable]: k227: - | In file included from > /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hc c:hon11a:s: n218tIn file included from n:/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h e15:In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:il:174n: t/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.hL warning: :ounused variable 'bid' [-Wunused-variable]b145; i:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ d 14218: | | =warning: ^~~ unused variable 'data1' [-Wunused-variable] n const int bid = ncclShmem.channelId - work->In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 145 | uint32_In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cppt data1, flag1/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h,:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] c clShm145eIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ m | .chan nel Id - woruk->ichannnetlLo3; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h2_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hdata1, flag1, data2, flag2; | ^~~~~ :2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | :366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | uint32_t data1, flag1, data2, flag2; | ^~~~~ :218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = const intIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ biIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ d = ncclShmem.channelId - work->channelLo; | ^~~ ncclShmem.channelId - work->channelLo; In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:20:15: warning: unused variable 'bid' [-Wunused-variable] 20 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:20:15: warning: unused variable 'bid' [-Wunused-variable] 20 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:20:15: warning: unused variable 'bid' [-Wunused-variable] 20 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:20:15: warning: unused variable 'bid' [-Wunused-variable] 20 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelIIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ d - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:20:15: warning: unused variable 'bid' [-Wunused-variable] 20 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ _group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ ->channelLo; In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - w/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | conIn file included from o/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from r/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.hk:271:19:- warning: unused variable 'ptr' [-Wunused-variable] >271 | c huint6a4_t* pntr =n recvePtr(0l)+ll1L28Offoset; ; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ | ^~~ st int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:58:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 58 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:157:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 157 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllGather_RING_SIMPLE_Sum_i8_2, ncclFuncAllGather, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:58:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 58 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:157:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 157 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllGather_RING_SIMPLE_Sum_i8_2, ncclFuncAllGather, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:58:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 58 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:157:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 157 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllGather_RING_SIMPLE_Sum_i8_2, ncclFuncAllGather, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_2, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:58:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 58 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:157:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 157 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllGather_RING_SIMPLE_Sum_i8_2, ncclFuncAllGather, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdIn file included from x.x), gr/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:o11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:u670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] p670 | tid(tid), nthr(eads(nthreadgs), tidInrBlooup), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ ck(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_| tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthre90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primit 671 | i stepSivze(stepSizee_ == 0ads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL ?s ncclShmem.com, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 58 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:157:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 157 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ X_DEV_AR ITY, 1>, t/*Direict=d(tid), nthreads(nthreads), wid(tid*%/0, PrWoto, 0A> primRs | ^ P/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5_: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here S565 | ic, /*D nirectwarp(tid/WARP_SIZE), T reeU| pDow ~~~~~~~~~~~~~~~~~~n p rime s < | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h 1w:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_2, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ arpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | , 1, COLL_UNROLL>, COLL_UNROLL>(tid, nthreads, work); | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h(:432n:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested herec 432c | l iSf (thid T().Orun_(tiLd,In file included from L128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group s/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.hubtn, wor:k);58 | : ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp56:7::1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested herenote: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here7 | DEFI NE_ ncclD58evF | unc (Al lRe duc e_TPRErE_SIMPLE_MinMa/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:imi670ti:ves<15T, :Red Op,warning: Finitializer order does not match the declaration order [-Wreorder-ctor]anS ymm etric<1670>, | 0, Pro to, 0> prtims i | ^d /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h(:171:5t: note: iin instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here d171 | ) ,run Rinngunc(Mtid, ntihnMarx,e ahipd_bsfl,oa tw16o, NrCCkL_)AL;GO _ TR| EE ^, N/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hCCL_P:RO432TO:_S78IM:PL E,note: 2in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h432: | 611: 62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | i Rfun Wo(rktBaitcdh< co, algo, proto, unroll>().run();tn) RunWorkColl, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 58 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:157:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 157 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllGather_RING_SIMPLE_Sum_i8_2, ncclFuncAllGather, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ R:e670:d15:Op, Algo, Proto, COLL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllGather_RING_LL128_Sum_i8_2, ncclFunc note: Afield 'nthreads' will be initialized after field 'tidInBlock' l 670l | G a ttid(tid), nthreadhs(entrhr,ea dsF),u tnidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:58:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 58 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:157:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 157 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllGather_RING_SIMPLE_Sum_i8_2, ncclFuncAllGather, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid),Idx.x), group(g nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ rcSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62o:up ),note: expanded from macro 'DEFINE_ncclDevFunc'| ^~~~~~~~~~~ 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:58:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 58 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:157:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 157 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllGather_RING_SIMPLE_Sum_i8_2, ncclFuncAllGather, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:58:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 58 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:157:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 157 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllGather_RING_SIMPLE_Sum_i8_2, ncclFuncAllGather, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 1070 | runTreeSplit(tid, nthreads, work); | 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), In file included from | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp(:2t: ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_bf16_2, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ id),In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h: 11: nthreads(nthreads), tidInBlock(threadIdx.x), group(grIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(oup), | ^~~~~~~~~~~ threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 58 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:157:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 157 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWo, T, RedOp, Algo, In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:58:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 58 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:157:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 157 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here Proto, COLL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_2, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RIn file included from u/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:58:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 58 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:157:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 157 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllGather_RING_SIMPLE_Sum_i8_2, ncclFuncAllGather, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ rkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllGather_RING_SIMPLE_Sum_i8_2, ncclFuncAllGather, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ nWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , redop, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreadIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5:s(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_bf16_2, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_2, ncclFuncAllReduce, FuncMinMax,:670:15 : warning: initializer order does not match the declaration order [-Wreorder-ctor]h 670i | p tid_(tidb), fnthrleads(ontahreatds), 1tidI6nBlo,ck(t hreadNIdxC.x), CgrouLp(gro_up),A | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~L | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_G 671O | _TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | sIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_2, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_2, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_2, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SI 670 | tid(MPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tidIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, (tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_bf16_2, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_2, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_2, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:58:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 58 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:157:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 157 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllGather_RING_SIMPLE_Sum_i8_4, ncclFuncAllGather, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: : note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group),In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_2, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :2: In file included from ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:671 | stepSize(stepSize173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:_670In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_2, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_P:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | t=id(tid)= 0 , nt?/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h hrn:670:15: ecwarning: initializer order does not match the declaration order [-Wreorder-ctor] clShmem.comm.buffSizes[NCCads(nthreads), tidInBlock(threadIdx.x), grou 670p | tid((tid), nthgreads(nthrreads), otidInBlocuk(thp), readIdx.x), gro | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ ==ROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 0 ?up(g roup)n, | c ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | c tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ L_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_2, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 671 | l stSepSizhe(stemem.comm.buffSizpSizee_ == s0[NCC ?L nccl_ShmemPROTO_SIMPLE]/NCCL_STEPS/sizeo.comm.bfuffSiz(es[NCTCL_PR)OTO_SI MPLE]:/NCCL _SstepSize_) { | T ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~EPS/s izeof (T) :| ste group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hpSiz:254:e_) 90{ : note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | 254 group(group | Primitive/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.hs:58<:56: Tnote: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here ,58 | PRriedOp, mitives, /*Direct=*/0, Proto,, FanSy mmet0ric<>1>, 0, Pprotor, 0> iprimms s| ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h :157:5| : note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5:157 | runnote: Ringin instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested herep(tid,l nthreeads/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:58:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 58 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:157:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 157 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllGather_RING_SIMPLE_Sum_i8_4, ncclFuncAllGather, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ,<1, 1, COLL_UNROLL>, COLL_UNROLL>(tid, nthreads, work work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkCol/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(l(n).ru,n(ti d, swubtno, worrk);k | ^) /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:;12:1 : note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12| | DEF ^INE_ nccl/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cppDevFunc(A:llGa7ther:_:670R1:15:I: warning: initializer order does not match the declaration order [-Wreorder-ctor]N note: G670 | _in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here S tid I(tiMd),7P nt | LhreDEadsE_(ntFShreaIuds)Nm,E_ncclDevFunc(AllReduce_TREE_SIMP_i8_4, ncclFuncAllGather, FuncSum, int8_t, NC tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NLE_MinMaxC_bf1C6_2,L ncc_lFunScAllTReduEce, PFuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' C L_ALGO611_R | ING, NCC L_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, prot o ,R uunnWroorlklB>a(t)c.hr , S/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_2, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: 670a | l g o , tpirdo(ttoi,d )u,n rnotlhlr>e(a)d.sr(untnh(r)e;a d\s ) note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ,| ^t i/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hdI:nBl670o:c15k:( tnote: hfield 'nthreads' will be initialized after field 'tidInBlock'r e ad670I | dx . x t)i,d (gtriodu)p,( gnrtohupr)e,a d s| ( ^~~~~~~~~~~~~~~~~n t/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hhr:e670a:d60s:) ,note: field 'group' will be initialized after field 'stepSize't i d670I | n B l toicdk((ttihdr)e,a dnItdhxr.exa)d,s (gnrthoruepa(dgsr)o,u pt)i,d I | n ^~~~~~~~~~~~~~~~~B l/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hock:(670t:h60r:e anote: dfield 'group' will be initialized after field 'stepSize'I d x670. | x ) , gtriodu(pt(igdr)o,u pn)t,h r e| a ^~~~~~~~~~~d s(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:58:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 58 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:157:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 157 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllGather_RING_SIMPLE_Sum_i8_4, ncclFuncAllGather, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:58:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 58 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:157:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 157 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclD/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15:evFunc(AllGather_RING_SIMPLE_Sum_i8_4, ncclFuncAllGather, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_2, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:58:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 58 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:157:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 157 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllGather_RING_SIMPLE_Sum_i8_4, ncclFuncAllGather, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf16_2, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:58:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 58 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:157:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 157 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllGather_RING_SIMPLE_Sum_i8_4, ncclFuncAllGather, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf16_2, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(th/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_2, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdxreadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | .x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncc/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nlDevFtunc(AlhlReducre_TREEe_SIMPLaE_MinMdax_bf1s6_2, n)cclFun,cA tidInBlllReoduce, ck(threadIdx.xFunc)MinMax,, hip_ bfgroup(grouloat/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:58:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 58 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:157:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 157 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllGather_RING_SIMPLE_Sum_i8_4, ncclFuncAllGather, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx./builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:58:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 58 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:157:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 157 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tnthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:58:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 58 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_gather.h:157:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 157 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subp), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ tn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllGather_RING_SIMPLE_Sum_i8_4, ncclFuncAllGather, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here id < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_gather_sum_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(AllGather_RING_SIMPLE_Sum_i8_4, ncclFuncAllGather, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp : 12 tid:(ti1d),: nt hrenote: adsin instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here(nt hre ads),12 ti | dInDBloEck(FINE_ncclDevFunc(AllReduce_RIthNreaGdId_x.x/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_2, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ S), IgroMup(PgroLup)E, _| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ M| tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ i 671 | n Max_bf16_2, ncclFuncAllReduce, FuncMinM a stxepS,iz e(shteipSipze__ ==b 0 f? nlcclSohmeam.ctomm1.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : s6, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc'tepS i 611 | RunWorkBatch{ ,| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(groupa /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hl:303g:90: onote: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here , 303 | p rPriotmitoive,s<1().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h, NCCL:_MA670X_D:EV15_AR:ITY >, note: /field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthread*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDso), wtidnInBl , C OL| L_UNRO ^~~~~~~~~~~LL> (tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_2, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_2, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf16_2, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_2, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_4, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf16_2, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf16_2, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SI/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] MPLE_MinMa670x_bf16_2, | ncclFunc AllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCC tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_4, ncclFuncAllReduce, FuncMinMaL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hx, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid | ( RunWorkBtatcheads(nthe,reads), a tidds(nthreaalgo, protod, InBlocksu(threa)d, tidnrIoll>Idx.xn(),B).run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 gr | oup(gro up), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | s tid(tid)tep,Size(st nepSize_ == 0 ? ncclShmem.comm.buffSizloces[NCCL_PROTO_SIMk(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ E]/NCCL_STEPS/sizPLE]e/NCCL_oSTEPSf/siz(eof(TT) : s)tepSi ze_) : stepS{ i| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ ze_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56| group(group: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h: 254:90:note: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here254 | P rimitives, /*Direct=*/0, Proto, 0:63 | 670 > :P rim15iptiv:res, ProtoSimple<1, 1, 4>, 4>' requested here warning: initializer order does not match the declaration order [-Wreorder-ctor] 670565 | | tid(tid), nthreads(nthreads), tRedOp, iFanSydmmetrIic<1n>, 0,B Protol, 0> porimsc | ^ k/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558(:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested heret 558 | h rruneRing(tidp, nth(readsg, workr); | o ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hu:432:78:p note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here )432 , | runT ree UpDo| w n< ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~T, R edO p, P| roitoSim tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_fple < 1, (1, COLtL671_UNiROLL>d, COL L_UNROLL>(tid, nthreads, work); | s| tep ^Size( stepS/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hize_ == 0< su: btn)432? Run: Wor78nkCol:cl, 0, 2, 4>::run' requested herep, Ah melg432 | if (tid m<.comm .buffsSizesu[NCCL_bo, PPrRoto, COOLLt_TUNnROOLL)>_(). rSun(RunWorkColl, 1, 2, 2>::run' requested here : 12 | 56DEFINE:_nccl DevFunote: nc(in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | AllRe duce_ ROILL _UNNPrGR_OiLL>S(m).rIuitives, 0, _bf16_2, ncclFuncAllReduce, FProto, 0n(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_4, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ > prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | u ncrMiunMnaxR, ihinp_gbexpanded from macro 'DEFINE_ncclDevFunc'(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if ( t611 | i d Ru,r kalgCoo, lprloto<, Funnro,ll >(T).,ru n(R);e \ 12 warnings generated when compiling for host. d O| ^ p/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h,:670 :15:A note: lfield 'nthreads' will be initialized after field 'tidInBlock' g o670, | Ptirdoto(tid),, COLL_UNROLL>().run(tid, su nbthtrenad,s(n thwreoadrs)k, )ti;dIn Bl o| c ^k(thr ea/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cppdId:x.12x):1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncc,l gDroeupv(gFrouupn),c (| ^~~~~~~~~~~~~~~~~A /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hll:670R:60e: dnote: field 'group' will be initialized after field 'stepSize'u c670e | _ R ING_SIMPLE_MinMax_bf16_2, ncctild(Ftiud)n, cntAhrleladsR(nethdreuacdse),, ti dIFnBulonckc(tMhrieandIMdxa.xx),, g hip_bfloat16, NCCL_ALGO_RIroup(group), | ^~~~~~~~~~~ NG, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEF/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) :INE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf16_2, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_4, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_4, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_MinMax_bf16_2, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_4, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_4, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h: ti670:15: dwarning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | ( tid), nt h tid(trieads(nthreadd),s nthrea)ds, tid(InthreanBlds)o, tidck(threadIdx.x), group(group), | ^~~~~~~~~~~ InBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_4, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(gro/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hup), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, wor/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Prtidi), nthrmeads(nithreadts), tiidInBlocvk(threeadIdx.sx), gr, /*Direct=*/ ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/0,s Proito,z 0> perimso | ^f /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:(565:5:T note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here k); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_4, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 565 | ) : ste runTreeUpDown, COLL_UNROLL>(tid , | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(groupn /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:t63:56:h note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here r 63 | e Parimidtivess, )0, P;ro | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, workColl().run(tid,In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:17:1k):; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432 :78: note: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf16_2, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 432 | if (in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested heretid < su btn17 | DEFINE_ncclDevFunc(AllReduce_) RTunWorRkCollF().ruun(tnid, csubtAn,llReduce, FuncMinM work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkMPBLE_MainMatx_bfc16_4h, nccfloa,t16, NCCLa_ALGlO_go, proto, unroll>().rRINuG, NnCCL_(PROT)O_SI;MPLE, 4) \adIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_4, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | ti| ^d /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h(:t611:i62:d note: )expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, adsl),g toidI,nB lopckroto, unrol(lth>re(ad)Idx.x), gro.urupn(();group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' \ 670 | | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h :670 :15: note: tfield 'nthreads' will be initialized after field 'tidInBlock' i d670 | ( t itidd()ti,d) , nnthtrehadrs(entahrds(nthreads), tidInBlock(threadIdx.x), group(egadrs)o, utpidIn)Block,(th r ea| dI ^~~~~~~~~~~dx .x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_4, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_4, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ o, Proto, COLL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_4, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_4, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(gro/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hup), :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.co670 | tid(tid), mnthreadms(nthr.eads),b tidInBulock(/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf16_4, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, fthreadfIdx.xS), groizes[NCCL_PRup(gOroup)T, | O_S ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ IMPLE]/NCC | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclSL_ShTEPS/msizeofe(T) : smtepSize._) { calgo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(groupo /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hmm.:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested herebuffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_4, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 565 | runTreeUpDown, COLL_UNROLL>(/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.htid, nthreads, wor:k670:15:) warning: initializer order does not match the declaration order [-Wreorder-ctor] ; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78670: | tidnote: (tidin instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here), 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:17:1: note: stin instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested hereepS ize (stepSize17_ == | 0 ?D nccElShmFem.cIomm.NbuffESize_s[NCnCL_PcROTOc_SlDevFuncIM(PLE]A/NCClLlReduce_TREE_SIMPLE_MinM_STaEPS/xsize_of(Tb) : fstep1Size_6) {_4, ncclFun c | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ A | group(group l/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303l:/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hReduce, Fun90: cnote: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | :670M :15: i warning: initializer order does not match the declaration order [-Wreorder-ctor] nMax, hip_bfloat16, NCCL_ALGOPrim_itiTves, /o*Dirrect=k*/0, BProtao, 0t> pcrimsh | ^< /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hc : oll, ty, r565e:5: note: din instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here o565 | 671 | p < stet pSiyrze(>uste,npSiz Te_ ar==le 0 ge?o, proto, UpDouwn().ruOnp, (P)roto;SclS ihmem\m.c pomm. lbuf| efSi ^,d CsOL)L_,UN ROtLLi>(dtiId,n nBthlreoadcs,k w(ortk)h; r | e ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hSIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf16_4, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ a:432d:78I: dnote: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested herex .432 | xi)f (tid < subtn) RunWorkColl),( t)id.InrBluocnk(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | (DthEreFadIIdxN.xE),_ gnroucp(cgrlouDp)e, v | F ^~~~~~~~~~~ unc(AllReduce_TREE_SIMPLE_MinMax_bf16_4, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_4, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf16_4, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf16_4, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | t/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_4, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf16_4, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 671 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_4, ncclFuncAllReduce, FuncMinMax, hip_bfloat16 stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> pri, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ms | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf16_4, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize'/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf16_4, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf16_4, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf16_4, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf16_4, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf16_4, ncclFuncAllReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 12 warnings generated when compiling for gfx1201. 12 warnings generated when compiling for gfx1030. 12 warnings generated when compiling for gfx1101. 12 warnings generated when compiling for gfx1100. 12 warnings generated when compiling for gfx1102. 12 warnings generated when compiling for gfx908. 12 warnings generated when compiling for gfx1200. 12 warnings generated when compiling for gfx942. 12 warnings generated when compiling for gfx906. 13 warnings generated when compiling for gfx90a. [ 58%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_bf8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_bf8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_bf8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_bf8.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdxIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t da.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ ta1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_grouIn file included from p(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int wIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = uintIn file included from 64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work- >| ^~~ channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId -In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ work->channelLo; | ^~~ :218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | cIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ onst int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:7: warning: unused variable 'w' [-Wunused-variable] : 75 | 2 bar: rier_byIn file included from _group(/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h); | ^~~~~~~~~~~~~~~~~~: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:1511: note: expanded from macro 'barrier_by_group' : 29 | In file included from const i/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.hnt w =: th173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:r7eadIdx.: warning: unused variable 'w' [-Wunused-variable] 75 | x/ WARP_SIZE; \ | barrier_by_group( ^ ); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.hIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); : 29:15: | note: expanded from macro 'barrier_by_group' 29 | ^~~~~~~~~~~~~~~~~~ c onst in/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.ht w = thre:adIdx.x29/WARP_SI:ZE; \ | ^ 15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_grou:366:15: warning: p(); | ^~~~~~~~~~~~~~~~~~unused variable 'bid' [-Wunused-variable] 366/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h | const: int29:15: note: expanded from macro 'barrier_by_group' 29 | cons btid = n cclShmeim.chnannetl w = threadIdx.x/WARP_SIZE; \ | ^ Id - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WAIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174R: P_/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uSIZE; i\ | ^ nt32_tIn file included from data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data: warning: unused variable 'data2' [-Wunused-variable]2 145 | , uint 32_t dfata1, lflag1,a data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flagg2; | 2 ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:;21: warning: unused variable 'flag1' [-Wunused-variable] 145 | | ^~~~~ uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_byIn file included from _group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int wobrk->channeilLo; | ^~~ d = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.ch/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hannelId - work->channelLo; | ^~~ :366:15: warning: unused variable 'bid' [-Wunused-variable]In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_2, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_2, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_2, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_bf8_2, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_2, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_bf8_2, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_2, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from 28, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h::670432::1578:: warning: note: initializer order does not match the declaration order [-Wreorder-ctor]in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | 670 | i f (ttiidd( to(u)p.)r,u n (| t ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~i d ,| tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_s ubtn, wor k671) | ; | ^s tepSize(st/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cppe:p5S:i1z: enote: _in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here == 0 ? nc c5l | SDhEmFeImN.Ec_onmcmc.lbDuefvfFSuinzce(sA[lNlCRCeLd_uPcReO_TTOR_ESEI_MLPLL1E2]8/_NMCiCnLM_aSxT_EbPfS8/_s2i,z enocfc(lTF)u n:c AsltleRpeSdiuzcee_,) F{u n c| M ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i n M| a group(groupx , rccl_bfloat8/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h,: 254N:C90C:L _note: Ain instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested hereL GO_TREE ,254 | N C C L _ P RPOrTiOm_iLLt1i2v8e,s <2T), R| e^d Op, FanA/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hs:y611m:m62e:t rnote: iexpanded from macro 'DEFINE_ncclDevFunc'c c,h ,, 0a>l gpori,m sp r o| t ^o , unr/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.ho:l565l:>5(:) .note: rin instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested hereu n(); \ 565| | ^ runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_2, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_2, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_2, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdxIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_2, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), g.x), group(group), | ^~~~~~~~~~~ roup(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf8_2, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_2, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_2, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_2, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_2, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_2, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf8_2, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_2, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ mem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_2, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_2, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_4, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf8_2, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_2, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_2, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf8_2, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf8_2, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_4, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(s/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ tepSize_ == 0 ? nccl671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf8_2, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Shmem.comm.buffSizes[NC/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PRCL_PROTO_SIMPLIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBE]l/NCCL_SToEPS/sizeocf(TOTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_2, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ) : kstepSize_) ({ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ t | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303h:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303r | PreimitiadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ve?s, /*Deirect=m*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5:.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | ru/nTreeUp*Down, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | roto , CO LL_UrNROLuL>(n).ruTn(tireeUpDown | ^, /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp :17:1C: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested hereO L17 | DEFLINE__ncclUDevFuNnc(RAllROeLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid ().run(_AtLGO_iTREdE, N,CCL_ PROTsO_SuIMPbLE, t4) n| ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h,:611:62: note: expanded from macro 'DEFINE_ncclDevFunc'w 611o | rk); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_ M RuniWorknBatcMhf, al8go, _pro2to, u,nrol l>()n.runc(); c\ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:l670:15: Fnote: field 'nthreads' will be initialized after field 'tidInBlock' u670 | SnI ZEc), A | l ~~~~~~~~~~~~~~~~~~t l| i stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) R d507 | e ( dtwariupIndBlock)(thr,eadI dx.xn/WARtP_SIhZE),r | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e | warp(tid/WARP_SIZEa 508d | s fla(gThrnead(t(tidh%4)=r=3),e grouap(grdoup), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/Nce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62s: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | ), t idInRBluock(nthrWeadoIdxr.x), groukp(gBrouap),t | c ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_MinMax_bf8_2, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ h:670:<60: cnote: field 'group' will be initialized after field 'stepSize' o 670 | l ltid,(ti d),t ntyhre,ad s(nrthreeadds),o tipdInadI,d algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hx.x), group(group), | ^~~~~~~~~~~ :670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclSh/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_4, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf8_2, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf8_2, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf8_2, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf8_4, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_2, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_4, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_4, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_4, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_4, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_4, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_4, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf8_2, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf8_4, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf8_4, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWork tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | sBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto,/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_4, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf8_2, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock'/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_4, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_4, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_4, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_4, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_4, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_4, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf8_4, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf8_4, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_4, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf8_4, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_4, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf8_4, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_4, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_4, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf8_4, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_bf8_4, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf8_4, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf8_4, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_bf8_4, ncclFuncAllReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for gfx1201. 18 warnings generated when compiling for gfx1100. 18 warnings generated when compiling for gfx942. 18 warnings generated when compiling for gfx906. 18 warnings generated when compiling for gfx1030. 18 warnings generated when compiling for gfx908. 18 warnings generated when compiling for gfx1102. 18 warnings generated when compiling for gfx1200. 18 warnings generated when compiling for gfx1101. 22 warnings generated when compiling for gfx90a. [ 59%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_f16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_f16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_f16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_f16.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: : In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.hIn file included from :174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h7: warning: unused variable 'w' [-Wunused-variable] : 75 | 174barrier_b: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | y_g roup(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | barri conset ir_by_group()nt w ;= threadI dx| ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15:.x/WARP_ SIZE;note: expanded from macro 'barrier_by_group' \ | 29 | const in ^ t w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, dIn file included from a/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from t/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174a: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: 2unused variable 'data1' [-Wunused-variable] 145 | u,i flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, dnt32ata2, f_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145lag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_ | t uint 32_td data1a, fltag1, daata2, 1flag,2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:f35: warning: unused variable 'flag2' [-Wunused-variable] l ag145 | 1, data2, flag2; | ^~~~~ uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: In file included from unused variable 'w' [-Wunused-variable] 80 | barrier/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: _In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.hb:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:y_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 80 :5: warning: 29unused variable 'w' [-Wunused-variable] 80 | | b arrier _by_g ro const int w = threadIdx.x/WARP_SIZE; \ | ^ up(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ * ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelIchdannel Lo; -| ^~~ work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | :218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channe/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hlL:o366:15: ;warning: unused variable 'bid' [-Wunused-variable] | 366 | const ^~~ int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2,In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:In file included from 271:19: warning: unused variable 'ptr' [-Wunused-variable] /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | 271 | ui nt64_t* pbarrier_by_group()t;r = rec vPtr(0 )+ll128| Offset; ^~~~~~~~~~~~~~~~~~ | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclS/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hh:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barriemem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int br_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ id =In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ x.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ 366 | const int bid = In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_2, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_2, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_2, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_2, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_2, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f16_2, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f16_2, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncMinMax<__half>, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_f16_2, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_2, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncMinMax<__half>, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_f16_2, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncMinMax<__half>, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_f16_2, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_2, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_2, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_2, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_4, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_4, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_2, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_2, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_2, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f16_2, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_2, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_4, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_4, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthr/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.heads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f16_2, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f16_2, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f16_4, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_4, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_2, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < sub/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_2, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_2, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f16_4, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_MinMax_f16_2, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_4, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdxIn file included from .x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f16_2, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | ste p670S | i z e ( sttiedp(Stiizde)_, =n=t h0r e?a dnsc(cnltShhrmeeamd.sc)o,m mt.ibduIfnfBSliozceks([tNhCrCeLa_dPIRdOxT.Ox_)S,I MgPrLoEu]p/(NgCrCoLu_pS)T,E P S| / ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~s i z| e tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_o f(T) : s671t | e p S i zset_e)p S{i z e| ( ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s t e| p group(groupS ize_ == 0 ? ncclShmem.comm.buff/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hS:i254z:e90s:[ Nnote: Cin instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested hereC L_PROTO _254S | I M P L E ] /PNrCiCmLi_tSiTvEePsS, /*Dir/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.he:c254t:=90*:/ 0note: ,in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here Proto, 0254> | p r i m s P r| i ^m itives, ProtoSimple<1, 1, 2>, 2>' requested hereF anAsym m565e | t r i c ,, P/r*oDtiorSeicmtp=l*e/<01,, P1r,o tCoO,L L0_>U NpRrOiLmLs> , | C ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDoOwLnL<_TU,N RROeLdLO>p(,t iPdr,o tnotShirmepaldes<,1 ,w o1r,k )C;O L L| _ ^U NROLL>, COL/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hL:_432U:N78R:O Lnote: Lin instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here> (tid, n t432h | r e a d s , iwfo r(kt)i;d <| ^s ubtn) RunW/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.ho:r432k:C78o:l lnote: , 0, 2, 4>::run' requested hereF n, T, R e432d | O p , A l giof, (Ptriodt o<, sCuObLtLn_)U NRROuLnLWo>r(k)C.orluln<(Ftni,d ,T ,s uRbetdnO,p ,w oArlkg)o;, P| r ^o to, CO/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cppL:L7_:U1N:R Onote: Lin instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested hereL >().run( t7i | dD,E FsIuNbEt_nn,c cwloDrekv)F;u n c| ( ^A llReduce/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp_:T17R:E1E:_ Snote: Iin instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested hereM PLE_MinM a17x | _DfE1F6I_N2E,_ nnccccllDFeuvnFcuAnlcl(RAeldluRceed,u cFeu_nTcRMEiEn_MSaIxM,P LhEa_lMfi,n MNaCxC_Lf_1A6L_G4O,_ TnRcEcEl,F uNnCcCALl_lPRReOdTuOc_eS,I MFPuLnEc,M i2n)M a x| ,^ half, /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hN:C611C:L62_:A Lnote: Gexpanded from macro 'DEFINE_ncclDevFunc'O _TREE ,611 | N C C L _RPuRnOWToOr_kSBIaMtPcLhE<,c o4l)l , | t^y , redop:,62 :a lnote: gexpanded from macro 'DEFINE_ncclDevFunc'o , pro t611o | , u n rRoulnl>(W).run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.horkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_2, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_4, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_2, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_2, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f16_2, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_4, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_4, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_4, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f16_4, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f16_2, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_2, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads),/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f16_4, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4 tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_2, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f16_2, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f16_4, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_4, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_4, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_4, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_4, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize'/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f16_2, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unrol l>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f16_2, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_4, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f16_4, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_4, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_4, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_4, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f16_4, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidI/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 nBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShm? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f16_4, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ o, Proto, COLL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_4, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f16_4, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_4, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f16_4, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f16_4, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f16_4, ncclFuncAllReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for gfx942. 22 warnings generated when compiling for gfx90a. 18 warnings generated when compiling for gfx1101. 18 warnings generated when compiling for gfx1200. 18 warnings generated when compiling for gfx908. 18 warnings generated when compiling for gfx1030. 18 warnings generated when compiling for gfx1201. 18 warnings generated when compiling for gfx1100. 18 warnings generated when compiling for gfx908. 18 warnings generated when compiling for gfx1200. 18 warnings generated when compiling for gfx942. 18 warnings generated when compiling for gfx1101. 18 warnings generated when compiling for gfx906. 18 warnings generated when compiling for gfx1201. 18 warnings generated when compiling for gfx1102. 18 warnings generated when compiling for gfx1102. 18 warnings generated when compiling for gfx1030. 18 warnings generated when compiling for gfx1100. 18 warnings generated when compiling for gfx906. [ 59%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_f32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_f32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_f32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_f32.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp 22 warnings generated when compiling for gfx90a. [ 59%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_f64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_f64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_f64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_f64.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t :15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: y/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] ,77 | head, uint32_mantisst y, head,In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ a; | ^ mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantis/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ sa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | constIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: In file included from expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t da/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: tIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.ha:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:15: warning: unused variable 'w' [-Wunused-variable] ,80 | barrierf_by_grolup(); a| ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:g15: note: expanded from macro 'barrier_by_group' 129 | c,onst in t w = tdhreadIdax.x/WARtP_SIZE;a \ | ^ 2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ ; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' In file included from 29 | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ const int w = threadIdx.x/WARP_SIZEIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ ; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:In file included from 145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h :11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:u175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_gr5i: warning: unused variable 'w' [-Wunused-variable] 80oup(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = nthreadIdx.x/WARP_SIZE; \ | ^ t32_ | t data1,barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group'/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27: 15: warning: unused variable 'bid' [-Wunused-variable] 27 | cons29t int b | id = nc clShmem .channe lId - w ork->chcannelLoo; | ^~~ nst int flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t d aw = thrteadIdx.ax/WARP_1SIZE; \ | ^ , flag1, data2, flag/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 2182; | | ^~~~~ const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2:218: :15: warning: In file included from unused variable 'bid' [-Wunused-variable] 218 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h c:onst i11nt bid: = nccIn file included from lShmem/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h.chann:elId -175 work-: >cha/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.hnnelLo:; | ^~~ 271/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ :366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ :218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem./builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->cchannelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ hannelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const intIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | expanded from macro 'barrier_by_group'b 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ arrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | consIn file included from t int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, da/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ ta2, fl ag2; | | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: ^warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ 175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr In file included from =/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75 barrier_by_group():7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ ; | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdxIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | .x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uin tuint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid =/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp :2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11n: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.hc:75:7: warning: unused variable 'w' [-Wunused-variable]c 75 | l barrier_Sby_grouph(); | ^~~~~~~~~~~~~~~~~~ m/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: eexpanded from macro 'barrier_by_group' 29 | m const i.nt w = thcreadIdx.hx/WARP_SaIZE; \ n| ^ nelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h28: warning: unused variable 'data2' [-Wunused-variable] 145 | :366:15u: warning: unused variable 'bid' [-Wunused-variable] 366 | i consnt int bidt = ncclS3hmem.chan2nelId _t data1, flag1, data2, flag2; - work->channelLo; | ^~~ | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - wo/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ rk->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WAIn file included from R/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ P_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable]15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] const int bid = ncc 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ lShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_2, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_2, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_2, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), ntIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreahreads(nthreaddss(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_2, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid),), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_S nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ IMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_2, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_2, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthread, Algo, Proto, COLL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_2, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, protso(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_2, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_2, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f32_2, ncclFuncAllReduce, FuncMinIn file included from In file included from Max, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInB/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSizelo(ck(threst/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_2, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tiepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMadIdPx.x),L grouEp(grou]p), /| ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hN:670:60:C note: field 'group' will be initialized after field 'stepSize' C670 | L tid(_tid),S nthrTeads(EnthrePads)S, tid/InBlosck(thrieadIzdx.x)e, groupdInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ (group), | ^~~~~~~~~~~ of(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid,In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(n nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432threads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_2, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 32_2, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f32_2, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreIn file included from ads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | (ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:421: 9 Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_2, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ : note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_f32_2, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_2, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads),/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffS(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_2, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | izes[NCCL ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ _PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(g:1070:r5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here o1070 | u runTrepeSplit<)T, RedO,p, Proto LL128, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, wo | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, 0, 1, 2>::run' requested here 5 | DENFINEC_ncclCDevFuLnc(All_ReducMe_TREAE_LL128_MinMax_f32_2, ncclFuncAl/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] lReduce, FuncMinMax, fl o670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_4, ncclFuncAllReduce, FuncMinMax, flat, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBaX_DtEV_ARcITY>,h /*Di prlims ,| ^/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclS /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h :565:5t: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested herey 565 | , hmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_2, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ r unTreereUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78do:p, algonote: , proin instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested hereto, u nroll >().run();432 \ | | ^ if (tid < subtn) RunWorkColloat, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_2, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TRE/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x):670,:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] g670 | rtido(tid), nthrueads(pnthre(ads),g tidIrnBlocok(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSiup), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ze_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_2, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | ru/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_f32_2, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ nTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_2, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_4, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_2, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_2, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f32_2, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing warning: initializer order does not match the declaration order [-Wreorder-ctor] (670 | ttid(tiid), nthdreads,(nt nthreadshreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ?, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ncclS hmem.c/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cppomm.buffSizes[NCCL:_PROTO12_SIMPL:E]/NC1CL_STE:PS/siz eof(T)note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here :12 ste | DEFINpSizeE_nccl_) {D | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~evFunc(AllReduce_RING_S | group(group I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63M:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here P 63 | L E_MinMax_ Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreadf32_2, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ s/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f32_2, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56:/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_4, ncclFuncAllReduce, FuncMinMax, f note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:12:1l:oat, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f32_2, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f32_2, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unro().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthrea/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hd:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f32_2, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ s(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Pr/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2oto: , 0In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: >In file included from pr/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:ims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | 173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(g rrunTroeeUpDuownp, COLL| _UNRO tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_L 671 | stL>(etid,p nthrSeaidzse(stepSize_ == 0 ? nc, wocrk); l | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hS:432:h78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here m em.comm.buf432 | f S if (itizes[d ().run(tidpSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, 0, 2, 4>::run' requested hereX _DEV_ARITY, 1>, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5 :17 | D EFnote: INE_in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested herencc lDev Func(All565Redu | ce_T REE _SIMP LE_M inMarx_f3u2_4n, ncTclFurnceAllReeducUe, FpuncMDinMaxo, floatw, NCnCL_A, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Op, ProtoSimple<1, 1, COLL_UNROLL>, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().In file included from r/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_2, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ un(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_2, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f32_2, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_2, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_4, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | step/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f32_2, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Size(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSiz/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.he_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /t*Did(tid), nthreirect=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COL L_UNRO LL>(t| id, n group(groupthrea ds, w/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hork); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h::432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 63 432 | : 56if (:tid < subtnote: n) Runin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested hereWorkC oll ().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_nc63 | c PlrimitDievFunc(Alves, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid , algo, proto, unroll>().run()Fn;, T, Re\dOp, Al go, | Prot ^o, C OLL_/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hUNROLL>(:).run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc670(:15: Anote: field 'nthreads' will be initialized after field 'tidInBlock' l 670l | R teid(tdid)u, ncthreead_RING_SIMPLE_MinMax_f32_4, ncclFunsc(nthreads)A, tlidIlnBlRock(ethrdeaudIcdx.ex),, group(gro up)F, u ncMinMax, float, NCCL_ALGO_RING| ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group, NC(CL_gPROrTO_oSIMuPLEp, 4)) , | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h :| 611 ^~~~~~~~~~~ :62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_2, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_4, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_4, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f32_4, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hUNR:670O:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] L670 | tLid(ty>, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidIn>(tid, nthreadid), nsthreads,Block(threadIdx.x), group(group), | ^~~~~~~~~~~ ( work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | nthre ads), ti dInBlock (threa dIdx.x),if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_2, ncclFuncAllReduce, FuncMinMax, ze_ ==d 0 ? nocclShmemu.comm.buffbSizes[lNCCL_PReOTO_SIM,PLE]/NCC L_STEPSN/sizeof(CT) : steCpSize_)L { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ _ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:A254:90: note: Lin instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | G PriOmitives<_T, RedOTp, FanAsRymmetricE, /*Direct=*/0, ProtoMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_2, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), gro RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:,15 0> pr:ims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5note: : note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here field 'nthreads' will be initialized after field 'tidInBlock'565 | r unTreeU pDown,up(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ COLL_UNtROLL>(tiid, nthdre:670:15a:( warning: initializer order does not match the declaration order [-Wreorder-ctor] dt670 | s tiid(ti,d),d nt hre)adsw(nt,hreoads) , rtidInnBklockt(t)hrhea;dIdxr.x) , gerou p(graou| p), d | ^ ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_s 671 | (/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h n stepSitze(stepSh:ize_ r432== 0 :? nceads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_4, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTiOze_) {_ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~S | I group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hM:303:90:P note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested hereL 303E | , PrRiedOp, ProtoSimple<1, 1, COLL_UNROLL>, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_2, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(gmitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | ruroup), | ^~~~~~~~~~~ nTreeUpDown>, C,OLL _UNRaOLLl>(tgid,/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.ho nth,re adsp, wo:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(grrko); t | ^o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h,:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested hereu n 432r | o l l>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlroup), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work);ock(itf (htidr < esubatn)d RuInWodrkCoxll<. | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_4, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Fxn, )T, ,Red Op,g Alrgo,o Pruotop, C(OLLg_UNrROLoL>(u).rpun()tid,, su btn , w| or ^~~~~~~~~~~~~~~~~k); | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp::17:1670: note: :in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 6017 | D:E note: field 'group' will be initialized after field 'stepSize' FINE_ncclDevF670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ unc(AllReduce_TREE_SIMPLE_MinMax_f32_4, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_2, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group)/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), ,t | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_4, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ idInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 7 670 | | tDEFINEi_d(tid)n,cclDevFunc(AllReduce_TRE nthrEeads(n_threadsS), tiIdInBMPLE_MinlockM(threax_f64adIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ =_2,= ncclF uncAll0Redu ce, F?uncMin Max, dncclShmouble,e NCCL_mALGO_T.REE, NCcCL_PROoTO_mSIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run()m.b;uffSiz es[NCC\L_PROT O_SIMP LE]/NC| CL_STE ^PS/siz eof(T)/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h : stepSize_) { | : ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h670:254:90:: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 15254 | : P rimitinote: ves, nt,hrea ds(n/thre*ads)D, tiidIrect=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5n:Blo ck(tnote: hreain instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested heredIdx .x) , group565(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadId | x ru.nTrexeUpD), group(group), | ^~~~~~~~~~~ own, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_4, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_2, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_4, ncclFuncAllReducIn file included from e, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.nthxreads)),, group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ tidInBlock(671thread | Idx.x) , gro up(g roup), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hs:670:60:t note: field 'group' will be initialized after field 'stepSize' epS 670ize(stepSize_ = | =tid(tid ), nth0 ? ncclShmem.comrmeads(n.threadsb), tuffSizes[NCCL_PROTO_SIMPLE]/NCCL_idISnBloTck(thEreadIdx.x), group(group), | ^~~~~~~~~~~ PS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_MinMax_f32_2, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ OLL_UNROLL>, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllRed/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f32_4, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ uce_TREE_SIMPLE_MinMax_f64_2, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_4, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ymmetric<1, NCCL_MAX_DEV_ARITY>, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_2, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f64_2, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(thread/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | t:id(t670id),: nt15hr:ea warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidIndsB(nthlreados), ctidIknBlo(ck(tthreahdIdxr.x),e graoup(dgrouIp), d | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ x| tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ .671 | x), grou p st(epSigzeroup), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 4 , ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ (671 | ste pSiz e_ = = 0 ? stepSize(stepSizncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCILdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ _STEPS/sizeofe_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group( T) :/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h stepSize_): { 63| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ :| group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here56 :303 | note: Prin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested hereimit ives , 0, Proto, 0> pymmertrici<1, NmCCL_sMAX_ DEV_ ARIT| Y>, ^/*Di rect/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h=*/0, Pr:oto,558 0> :prim5s | : ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h :565:5note: : note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested herein instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 565 | runTreeUpDown(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl C,, CO LL_UTNROLL,>(t RedOp, Algo, Proto, COiLd, ntLhrea_ds, UworNk);R | ^O /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:L432:L78: note: >in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here (432 | ) .if (rtid u< sunbtn)( RuntWorkiColld().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFI note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f64_2, ncclFuncAllReduce, FuncMinMax, double,NE _nccNlDevCFCL_ALGO_RINGun,c(Al lRedNuce_CTREE_CSIMPLLE_M_inMaPx_fR32_4O, ncTclFunOcAll_RedSucIe, FMuncMiPnMaLx, fEloa, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, alt, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unrogo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ll>/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h().run(:); \670 | ^: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:60670:15:: note: field 'nthreads' will be initialized after field 'tidInBlock' note: field 'group' will be initialized after field 'stepSize' 670 | tid(670 | t itidd(ti)d),, nt hrenadst(nthhreradse), taidIdnBlsock((thnreatdIdhx.xr), egroaup(dgrousp)),, | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.ht:670:i60: dnote: field 'group' will be initialized after field 'stepSize' I 670n | B lotcidk(threadIdx.x),(tid), nthreads(nthreads), tidInB grloupo(grocupk), (| ^~~~~~~~~~~ threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_2, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(gr/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_4, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ oup), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_2, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f32_2, ncclFuncAllReduce, FuncMiinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' d), nthreads(nthreads), tidInBlock(th 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ readIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_U/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:17:/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f32_4, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllRedid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCuCce_TRELE_SIMPL_E_MinMaxS_f32_4T, ncclFEuncAllRPeduce, SFuncM/inMax, sfloat, iNCCL_ALGzO_TREEe,of(T) : st eNCCL_PpROTO_Size_) { SIMPL E, 4)| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_2, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthread s 670 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_4, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthre( tid(ntid),t nthreahds(nthreraeds), tidInBlock(tads), htidInBrlock(tehreadIdx.xadIdx.)x), group(group), | ^~~~~~~~~~~ , group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ize_ == 0 ? ncclShmem.comm.buffSizesIn file included from [NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) {/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | PrimitivesthreadI,dx.x), /*Direct=*/0, Protogroup(,group) , | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ 0 | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ > prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:671 | 5stepSi:ze(ste pSize_note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown <== 0 ?T ncclS,hmem.c omm.buRffSizesedOp, ProtoSimple<[N1CCL_PR,OTO_SIMPLE]/NC 1, CCLO_STLL_UNROEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(groupLL>, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90:: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here7 254: | 1 P:rimi tivesnote: , 0, 2, 2>::run' requested hereRed Op, F anAsymme7tri | c, _/*Dinrectc=*/0c, Prolto, D0> perimsv | ^F /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:u565:5: nnote: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here c565 | ( ruAnTreleUpDlown, duce_TREEC_OLL_UNROLSIMPLE_MinMax_f64_2, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch( tidt, nythr,ead s, rworke); d| ^ o/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hp:432<:78t: note: yin instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here >432 | , iaf (ltid g< sou,btn ) RpunWrorkoColtl CO(LL_)UNR.OLrun() ;670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here \ L>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f64_2, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthrenthreads(nthreads), tidInBlock(thrSIeMPLaE_MdinMIax_df64x_2, .nccxlF)un,cAl lRgeduce, FurncMionMaux, pdou(blge, rNoCup), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ id), nthreads(nthreads), tidInBlock(threadIdx.x), gCrL_AoLGOu_TRpEE,( NCgCL_rPROoTO_uSpIMP)LE,, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h| :611:62: ^~~~~~~~~~~note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h stepSi:670ze(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCC:15L:_ warning: initializer order does not match the declaration order [-Wreorder-ctor]S T670 | E PtidS(tid/), nsthrieadzs(nethreoadsf), (tidTInB)loc k(th:rea dIdsx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSizetepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid,LE ]/NCnCL_tSTEhPS/rsizeeofa(T) d: sstep,Size _) { | w ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ o| group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hr:63:56: knote: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here ) 63 | ; P ri| mit ^ive s,note: 0,in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here Pro to, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing().run(tidedO,p, Prosto, uCObLL_UtNROnLL>,(ti d, wnthoreards,k wo)rk);; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h :| 432:78: ^ note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp432 | :if 22(ti:d <1 su:btn) Runote: nWin instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested hereork Col 22 | DElF().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllnMax, float, NCCL_ALGO_RING, NRCeducCe_RING_SLIMP_LPROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkEB_MianMatx_fc64_h2, L, algo, proto, unrol_PRlOTO>_S(IMP)LE, .2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hr:611:un(); \ | ^ 62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hRunW:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreadosrkBa(tchn, algo, proto, unroll>().run(); \ | ^reads), tidInBlock(threadIdx.x), group(group) ,/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h : 670:15| : note: field 'nthreads' will be initialized after field 'tidInBlock' ^~~~~~~~~~~~~~~~~ 670 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h ti:d(t670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads),id) , ntthrieadds(ntIhrenadsB), tlidIonBlcockk(thr(eadtIdxh.x), grrouep(garoudp),I | d ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hx:670.:60: xnote: field 'group' will be initialized after field 'stepSize' ) , group(group), | ^~~~~~~~~~~ 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_2, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f32_4, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), gr/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.houp(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSiz/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] es[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Prim 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(itives, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_4, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 1>, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DENROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ FINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_4, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f32_4, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tiIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_f64_2, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ dInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthre| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ a | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ d671 | sstepSize(stepSize_ ==/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h 0 ? ncclShmem.comm.bu:f670:15,:f workS); | ^i /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hwarning: :z432initializer order does not match the declaration order [-Wreorder-ctor]:78e: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | 670 | tid(tid), i f (tidn < subttn) hreads(RunWornkthreads), tidInBlock(threadIdx.x)C,ollCCL_PgR(OTO_rS)IMPLoE.]up)run(/,NCCLt_ STEPiS /sized| of(T) ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~, : st epSi sze_)| u { tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_b| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group t/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h :303671 | stepSize(stepSize_ == 0 ? ncclShn, mwoerk);m In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl, 1, 2, 4>::run' requested here m22 | DE.FINEb_ncculDevFunc(AllRe:90d: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested hereu 303 | c e Pr_imitRivesL, /*EDire_ct=*Min/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: ffnote: Sizin instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested herees[NCCdLOp, Algo, Proto, COLL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_f64_2, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ _P ROTO_S IMPLE]/565NCCL_STE | PS/s izeo f(T) : s tepMraxS_uf3i2_4,z nccelFun_cAll)Redu ce, {Func MinM ax, | fl ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primoait,t NCiCL_vALGeO_RIsNG,< NCTCL_,PRO TO_RSIMePLE,d 4)nOT pre ,eU| p^DoF wna/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h1 | ,, C O L0L _,U N RROPLuLr>n,o WCtOoLoLr_,UkNR BOL0aL>t>(tc idhp, nr, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing,, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | 670 | if ( tid < su btnt) RiunWdork(Coltl(tiLd, >nth(read)s, .worrk);u | n ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h(:432:t78:i note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested heredid ,) , nst432hur | ebad ts( nnt hre ads ), tidiInBfl ock((thtr, iewdaor dk), 0, 2, 2>::run' requested here/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] ko Cu7op | Dl)l, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_4, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ | ,E ^~~~~~~~~~~~~~~~~F INET/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h_n,c:c l670RedOp, Algo, Proto, C:O60: Lnote: field 'group' will be initialized after field 'stepSize' L 670_ | U tNid(RtidO), LnthLread>s(n(Dte)vhF.urnrce(uAalnldR(esdtu)cie,_dT R,EtE _iSsIdMuPILbEn_tMBinnlM,aox _fcw64ko_2(r, tknch)clr;Fue nca Ald| lRI ^edd ucx/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp.e, Fxun:)c12M,:in1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEF IgroNup(Egro_up)n, c| ^~~~~~~~~~~Mc alx, DevFunc(AllRdouble, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch,C alLgo,_ prALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' oto,611 un | rol l>( ).r un( ); R\ | u ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hn:670W:15o: note: rfield 'nthreads' will be initialized after field 'tidInBlock' k670 | B atidt(ticd),h nt, algo, ads), tipdInBrloock(tthroead,Id x.x)u, ngroroll>().run()up(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(; t\ i| ^ d/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:)670:,15 nthreads(nthread: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x),s) , gtidIrnBloocku(thprea(dIdx.x), ggrorup(gorouup),p | ) ^~~~~~~~~~~ , | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f32_4, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROT/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_2, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ O_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f64_2, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>(In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_f64_2, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ ).run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_4, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) :/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subt stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f64_2, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ _DEV_ARITY, 1>, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_4, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_2, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads),group), | ^~~~~~~~~~~ tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h | :670: 15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tird), nthureads(nnthreadsT), tirdInBlocek(threeadIdx.Ux), gropup(groupD), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~o | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ w671 | stnepSize(s, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkCollTEPS/sizeof(T)( : st)epSiz.e_) {r | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ u | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hn:63:56: note: (in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | t Pirimitives,b 0, tProto,n 0> p,rims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hw:558:5o: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here r558 | k r); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:17:1un:Ring< T, Renote: dOp, in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h 17 | DEProFto, CIO:L670:N15:L warning: Einitializer order does not match the declaration order [-Wreorder-ctor] _ _670 | U tNid(tiRdOLL>(tid, nthreads, work)), n;threads(nthre ads), tid| InBlo ^ck(th readIdx.x), group(group)n,cclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_4, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.c/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.ho:432m:78:m note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here. b432 | u f fSizes[ ifN (tCid ().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cppL_P:ROT22O_S:IMP1LE]:/NC CL_note: STEin instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested herePS/ siz eof(T22 | DEFINE_)n : cstecpSilze_D)evFunc {( | A ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | l group(group lReduce_RING_SIMPLE_Mi/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hn:M63:56a: note: xin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here _63 | f 3Pri2mit_ive4s, 0cA,llR eduPce, rFunocMitnMaox, , 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid,float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, nthrpeadrs, woortk);o | , ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h :u432:78n: note: rin instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here o432 | l l i>f ((tid) < .runsub(tn); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15) :Run Wornote: kCofield 'nthreads' will be initialized after field 'tidInBlock'll< Fn, T, Re670dOp | , A tid(tid),lg o, Proto, COLL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SInMthrPeadLs(nEthr_eadMs),i tindInBMlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadaxI_f6d4_2x, n.cx), group(group), | ^~~~~~~~~~~ clFuncAllReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWor/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_4, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ kBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : s/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), grotepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpSDymmetric<1>, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f64_2, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , ProtoSimple<1, 1, COLL_UNROLL>, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_4, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkB/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthrhreaedIdx.ax), grdoup(gsroup)), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ , | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | t stepiSdIinBlock(threadIdx.x), ze(gsteprSize_o == 0 u? nccplShme(group), m.co| mm.bu ^~~~~~~~~~~~~~~~~ff /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' Siz670es[NCC | L_PRO T tid(tid), nthreads(nO_SItMPLE]h/NCCLr_STEPSe/sizeaofds), t(T) :i stdIepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_4, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h runTreeUpDown, COLL_UNROLL>(tid, nthreads/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_4,:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] n670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group:670(:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] g 670 | r tido(tid)u, nthrpceclF)auncA,dllRe sduce, ( Fun| ncM ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~intM ax,h | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.coreadsm), tmidInB.lockb(thrueadIdxf.x)f, groSup(grioup), z | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ e| tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671ds | oubl[ e, NN CCL_C ALGOC_TRE Ls_E, tNPCCLe_RPROpOTO_SSTIMPLiOE, 4z)_ | eS^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hI:611:62:M( note: expanded from macro 'DEFINE_ncclDevFunc' Ps611 | Lt RuEenWor]pkBa/NCCL_STEPS/sSize_i =tch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ zeof(T) = :0 ?/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_4, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ncscltShememp.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCLSi_ze_S) {T | E ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ P| group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hS/:303:s90: inote: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here z 303e | o f Pr(imTitiv)es< T,: RedOp, FanAsymmetric<1, NCCL_MAX_DEV_ARITY>, /*Direct=*/0, Proto, 0> prim sste pSi ze_| ) { ^ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h| group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254::90: 565note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here : 2545 | : note: Priin instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested heremit iv es, D/*ownDi1, COLL_UN RpriOms L | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl>, (CO)LL_.UNRrOLLu>(tnid, (nthtreaids,d wo,rk) ; s| ^ u/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hb:432:t78: note: nin instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here ,432 | w oif r(tikd < )sub;tn) Run Wo| rkC ^oll , 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(All Algo, Proto, COLL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncRecducle_TDReEE_vSIMFPLE_MinMax_f64_4, ncclFuncAllReduce, FuncMinMax, dunc(ouble, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatchAlMin,Max , adoulbleg, NCoCL_,ALG O_TpRErE, NoCCLt_PRoOTO,_SI MPLuE, n4) r | o^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hl:611:62: lnote: expanded from macro 'DEFINE_ncclDevFunc' > 611 | ( ).run/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f64_2, ncclFunc() RunWorkBatch, al; g\ o| ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h,: 670:15:p note: field 'nthreads' will be initialized after field 'tidInBlock'r o670 | t otid,(tid ), unthnreards(onll>().ruthreads), tidInBlock(thrAellReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ adIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInn();B \ l | ^o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hc:670:k15: (note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | t htidr(tied),a ntdhreIadsd(ntxhre.adxs)), t,idI nBlgockr(tho/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here reaudIdpx.x)(, ggroupr(grooupu), p| ^~~~~~~~~~~~~~~~~ )/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670,:60 : note: field 'group' will be initialized after field 'stepSize' | ^~~~~~~~~~~ 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_4, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f64_4, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_2, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ m.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f64 ti_d(tid), 4nthreads,(nthread s), tidInnBlock(threcadIclFuncAllReduce, dx.x),F group(ugroupncMinMax, double, NCCL_ALGO_RING, NC),C | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~L | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize__ 671 | P stRepSizOe(steTpSizeO_ ==_SI 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { MPLE , 4) | | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nth:303:90:r note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here e303 | a Prdimitisvesg, /*rDireocup(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:t=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_4, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SI/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f32_4, ncclFuncAllRMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor]670 | tid(tid) , nth670rea | ds( nth reads) , t idtInBliocdk(t(hretadIidx.dx),) gro,up (gronup)t, h| ^~~~~~~~~~~~~~~~~ r/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.he:670:a60: dnote: field 'group' will be initialized after field 'stepSize' s 670 | ( n titd(thid)r, nethraeadds(ns), titeduce, FuncMinMax, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ hdreadIsnBlo), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ck(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f64_4, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_4, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/si/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_4, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ zeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDow/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_4, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ n, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f64_4, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_4, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tid/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads)In,Block( threadtIdx.x)i, groupd(groupI), | ^~~~~~~~~~~~~~~~~n /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60B: note: field 'group' will be initialized after field 'stepSize' l 670 | ock(t tid(thid), nrthreadseadId(ntxhr.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ eads), tid671InBloc | k(thre adIdx.x ), gro stepSizupe(group(), stepSize_ =| ^~~~~~~~~~~ = 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f64_4, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f64_4, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_4, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f64_4, ncclFuncAllReIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(ntp, Algo, Proto, COLL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_MinMax_f64_2, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ hreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f32_4, ncclFuncAllReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f64_4, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_4, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f64_2, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_4, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f64_4, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f64_4, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_4, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f64_4, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f64_4, ncclFuncAllReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for gfx1101. 18 warnings generated when compiling for gfx1201. 18 warnings generated when compiling for gfx1200. 18 warnings generated when compiling for gfx942. 18 warnings generated when compiling for gfx1030. 18 warnings generated when compiling for gfx942. 18 warnings generated when compiling for gfx1102. 18 warnings generated when compiling for gfx1100. 18 warnings generated when compiling for gfx1102. 18 warnings generated when compiling for gfx1101. 18 warnings generated when compiling for gfx1100. 18 warnings generated when compiling for gfx906. 18 warnings generated when compiling for gfx1201. 18 warnings generated when compiling for gfx908. 18 warnings generated when compiling for gfx1200. 18 warnings generated when compiling for gfx908. 18 warnings generated when compiling for gfx1030. 18 warnings generated when compiling for gfx906. 22 warnings generated when compiling for gfx90a. 22 warnings generated when compiling for gfx90a. [ 59%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_f8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_f8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_f8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_f8.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp [ 60%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_u32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_u32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_u32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_u32.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.hIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ :15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | co/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ nst int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ 15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ :366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ r = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: In file included from unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, f:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ lag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15:In file included from warning: unused variable 'bid' [-Wunused-variable] 218 | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ :15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = nIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_In file included from SI/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cppZ:E2;: In file included from \/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h : 11| : ^In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ cclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ :11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ :218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from 80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.In file included from channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree-In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ >down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Pro565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ to, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(g/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthroup), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ reads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run()670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.bu:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTr | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, workffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from ); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ eeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_4, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(t/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor]In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here670 | t1062i | d ( t i dr)u,n Rnitnhgrx(.txi)d,, gnrtohurpe(agdrso,u pw)o,r k )| ; ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | | ^ tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h : 432 : 78s: tnote: ein instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested herepS ize(ste p432S | i z e _ = =i f0 (?t indc c)( ):. rsutne(ptSiizde, _s)u b{t n ,| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ w or| k group(group) ; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h :303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 10 | DEFIN E303_ | n c c l D e vPFruinmci(tAilvleRseu,c e/,* DFiurneccMti=n*M/a0x,, Puriontto3,2 _0t>, pNrCiCmLs_ A L| G ^O _RING,/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h :N565C:C5L:_ Pnote: Rin instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested hereO TO_LL12 8565, | 2 ) r| u^n TreeUp/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hD:o611w:n62<:T ,note: expanded from macro 'DEFINE_ncclDevFunc'R edOp, P611r | o t o S iRmupnlWe , rCeOdLoLp_O,L La>l(gtoi,d ,p rnottho,r euandrso,l lw>o(r)k.)r;u n (| ) ^; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidhreadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(groupInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthr/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.heads(nthreads:), ti670dInBloc:k(threa15dIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_4, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ : warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_2, ncclFuncAlIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ lReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(thread 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Idx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_4, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_4, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nth/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_4, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ reads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , RedOp, FanAsymmetric, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_4, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_4, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_4, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_4, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_4, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_4, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f8_4, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here :78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_4, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f8_4, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work)/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_4, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_4, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), gro/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hup(group:670:)15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f8_4, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_4, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_4, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PRO/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ TO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_4, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_4, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_4, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_4, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f8_2, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_4, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_4, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_4, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f8_4, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_4, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threa/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads),dIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here: 670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 17 | DEFINE_ncclDevFunc (670A | l l R ed utcied_(TtRiEdE)_,S InMtPhLrEe_aMdisn(Mnatxh_rue3a2d_s4),, ntcicdlIFnBulnocAclk(ltRherdeuacdeI,d xF.uxn)c,M ignrMoauxp,( gurionutp3)2,_ t ,| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~N C C| L tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize__ ALGO_TREE ,671 | N C C L _sPtReOpTSOiz_eS(IsMtPeLpES,i z4e)_ =| =^ 0 ? nc/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hc:l611S:h62m:e mnote: .expanded from macro 'DEFINE_ncclDevFunc'c omm.b uf611f | S i z e sR[uNnCWCoLr_kPBRaOtTcOh_i,z eaolfg(oT,) p:r osttoe,p Suinzreo_l)l {> ( )| . ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r u n| ( group(group) ; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here670 | tid( t303 | i d ) , nPtrihmreiatdisv(enstu,p )/,* D i| r ^~~~~~~~~~~~~~~~~e ct=/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h*:/6700:,60 :P rnote: ofield 'group' will be initialized after field 'stepSize't o, 0> p670ri | m s | t ^i d(tid),/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h :n565t:h5r: enote: ain instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested hered s(nthre a565d | s ) , triudnITnrBeleoUcpkD(otwhnr, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_4, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/go, Proto, COLL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_4, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_4, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_4, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_4, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u32_2, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_4, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f8_4, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Idx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u32_4, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u32_4, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f8_4, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx./builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u32_4, ncclFuncx), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_4, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), ntAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_4, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ hreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u32_4, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < su:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_4, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ btn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_4, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u32_4, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_4, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u32_4, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*D/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here irect=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_4, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_4, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u32_4, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_4, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u32_4, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_4, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_4, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_4, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f8_4, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f8_4, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u32_4, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u32_4, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run()/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ .comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f8_4, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_f8_4, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f8_4, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u32_4, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_f8_4, ncclFuncAllReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u32_4, ncclFuncAllReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for gfx942. 1818 warnings generated when compiling for gfx1101. warnings generated when compiling for gfx1102. 18 warnings generated when compiling for gfx1201. 18 warnings generated when compiling for gfx1100. 18 warnings generated when compiling for gfx1030. 18 warnings generated when compiling for gfx908. 18 warnings generated when compiling for gfx1200. 18 warnings generated when compiling for gfx906. 22 warnings generated when compiling for gfx90a. [ 60%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_u64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_u64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_u64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_u64.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1= ncclShmem.channelId - work->channelLo; | ^~~ , flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | rIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, workunTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here edOp, Algo, Proto, COLL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_2, ncclFuncAllReduceIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_4, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize'In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown:,d) ,C OnLtLh_rUeNaRdOsLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ (nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_4, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_4, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_4, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | t:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ InBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclS/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_4, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ hmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_4, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u64_4, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primit/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_4, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u64_2, ncclFuncAllReduce, /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u64_4, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, protFuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ o, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_4, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u64_2, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_4, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_4, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_4, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_4, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for gfx942. /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_4, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthread: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ s, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_4, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u64_4, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_4, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_4, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_4, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u64_4, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u64_4, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_4, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, N/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_4, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ CCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_4, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u64_4, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_4, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u64_4, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u64_4, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u64_4, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u64_4, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u64_4, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u64_4, ncclFuncAllReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 22 warnings generated when compiling for gfx90a. 18 warnings generated when compiling for gfx906. 18 warnings generated when compiling for gfx1201. 18 warnings generated when compiling for gfx1102. 18 warnings generated when compiling for gfx1101. 18 warnings generated when compiling for gfx1100. 18 warnings generated when compiling for gfx942. 18 warnings generated when compiling for gfx1200. 18 warnings generated when compiling for gfx1030. 18 warnings generated when compiling for gfx908. 22 warnings generated when compiling for gfx90a. [ 60%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_u8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_u8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_u8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_u8.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp 18 warnings generated when compiling for gfx1201. 18 warnings generated when compiling for gfx906. 18 warnings generated when compiling for gfx908. 18 warnings generated when compiling for gfx1101. 1818 warnings generated when compiling for gfx1030. warnings generated when compiling for gfx1200. 18 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 18 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ [ 60%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_bf16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_bf16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_bf16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_bf16.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelIdIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | con - work-s>channelLto; | ^~~ int w = threadIdx.x/WARP_/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hSIZE; \ | ^ :218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: : In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:7: warning: :unused variable 'w' [-Wunused-variable] 75 | 75 b:arrier_b7y_group(:); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h :29:15: note: expanded from macro 'barrier_by_group' warning: 29 | counused variable 'w' [-Wunused-variable]nst int w = threadI 75 | barriedx.xr_by_g/WARP_SIZE; \ | ^ roup(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, fIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ lag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint3In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:1742_t data1, flag1, : /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ data2,/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:f175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:l5: warning: unused variable 'w' [-Wunused-variable] 80 | barrag2; | ^~~~~ier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const i /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.hn:145:35: warning: tunused variable 'flag2' [-Wunused-variable] 145 | uint32w_t data1 , flag1=, threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flIn file included from ag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ :2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = thrIn file included from eadIdx.x/W/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:A27:15:RP_SIZE; \ warning: unused variable 'bid' [-Wunused-variable] 27 | con s| ^ t int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ em.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | :366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll1In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ 28Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthread/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIs), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(gdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:7roup), | ^~~~~~~~~~~ :1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(gIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) Runroup), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ RedOp, FanSymmetric<1>, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidIn558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ if (tid < subtn) RunWorkColl, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_Algo, Proto, COLL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cppTREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_4, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), n/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hthreads(nthreads), t:i670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SdInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ of(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().ru ^n /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ s(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | t/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hid(tid), nthrea:670:ds(nthreads), tidInBlo15:c warning: initializer order does not match the declaration order [-Wreorder-ctor], 1, COLL_UNROLL>, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ k670 | ( tidt(tid)h, nthreadIrdeadsx(nthr.x), group(group), | ^~~~~~~~~~~ eads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize'/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp 670 | : 2tid(: tid)In file included from , nt/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hhrea:ds(11nthr: eadsIn file included from )/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h, tidInBlo:ck(t175hre: adI/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.hdx.x:), g508roup:(gro29up),: | ^~~~~~~~~~~ warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ OLL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_4, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | P/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ rimitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_In file included from PROTO_SIMPLE,/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2 : In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:112: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: )/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15 : warning: initializer order does not match the declaration order [-Wreorder-ctor] 670/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShm | | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize:611:62: note: (eexpanded from macro 'DEFINE_ncclDevFunc'm.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ s611 | RtunWoeprkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ S/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hize_ == 0 :? ncc670lShme:m.comm60.buff:Sizes[ NCCL_note: PROTOfield 'group' will be initialized after field 'stepSize'_SIMP LE]/N CCL_STEPS/size670of(T) | : st epSiz e_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group t/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hi:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0d(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ > prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(th611 | readIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 R | unWo rkBa tch< coll , ty,s tredoep, alSgo, iprotze(stepSize_ =o,= un roll0>() .r? ncclShmem.comm.buffSizes[NunC();C \ L | ^_ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hP:R670:15O: note: Tfield 'nthreads' will be initialized after field 'tidInBlock' O670 | _SIMPLE] / tiNd(tCidC), Ln_thrSeads(nTthrEeadsP), Stid/InBslocik(tzhreadIdx.xe), ogroup(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthrf(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto,e ads0(nt>hre adsp), rtidiInBlmocks(th rea dId| x.x ^), gr/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.houp:(gr558oup:), 5 | ^~~~~~~~~~~ : note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u8_4, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_4, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ reads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadNCICL_PROdTO_SIMxPLE]/N.CCL_STEPxS/size)of(T) ,: st group(groupepSize)_) { ,| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_note: 671 | in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here ste pSiz e(stepSize_ 565== 0 | ? nccl Shmem. comm.b runTuffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | PrireeUmpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subAX_DEV_ARITY, 1>, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COtLn, wLork)_; | U ^ N/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:17R:1O: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested hereL 17L | DEFINE_>nccl(DevFutnc(AillRedduce,_T REE_nSIMPtLE_hMinMrax_u8e_4,a ds, work)n;ccl Func AllR| ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: edunote: ce, in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested hereFunc Min 432 | if (tiMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | Rud < nsubtWn) RounWorrkColkl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:17:1atch<: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFI/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60:NE_ncclDevcoFll,u tyn, rcedo(pR, aelgo, pdrotuo,c uneroll_>()T.ruRn(E); \E | _ ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hSI:670:15M: note: Pfield 'nthreads' will be initialized after field 'tidInBlock'LE_MinM a 670x | _ tui8_4, ncclFuncAllReduce, Fud(ntidc), Mnthireands(Mnthareaxds),, t iduInBilocnkt8_t(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(n, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run()t hnote: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ;r ead\s), tid In| Blo ^ck( th/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hreadIdx.:x),670 gr:oup15(gr:oup ), note: | field 'nthreads' will be initialized after field 'tidInBlock' ^~~~~~~~~~~ 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBloc/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ k); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_4, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSiz:670:15:e warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | s tid(t[id), nthNreads(ntChreads),C tidInBloLck(threa_dIdx.x),P /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hROTO_SIMPLE]/NCCLgro_up(grouSp), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~TEP | :S/si670zeo tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_f :671 | (15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tiT) : dstepSize_() { t| stepSizi ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e(stepSd ize_), nth | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254 == 0 ?: ncc90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | lSh mem.comPm.buffSriimitives, /*DSirect=*//sizeo0, Proto, 0> fprims | ^( /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5T: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here )565 | ru nTreeUpDown, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here i303 | z PrimitieveLs, /*Direct=*/0, /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h>, COLL_UNROL LProto>, 0> (primst | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hid, nthreads, wor:565k:5: note: )in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565; | runTr e| ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if eUp(Down, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> pd , ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_4, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ COLL_UNROLL>().run(tid, subt COLL_UNROLL>, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_4, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ n, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_4, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_4, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_4, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_4, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_4, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u8_4, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ TREE_SIMPLE_MinMax_u8_4, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), grouIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreadp(gsroup),( | ^~~~~~~~~~~ nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: 0 initializer order does not match the declaration order [-Wreorder-ctor]? n 670 | c clShmtem.coimm.budffSize(s[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> ptrid),i nthrmeads(snthrea ds ), ti| dInBl ^ock(th readI/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hdx.x), grou:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn303,:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested herew o303 | r k )Pri;mit ive s, 1, 2, 2>::run' requested here 12 | DEFINE_ncclXD_DeEV_vARFITY>u, n/*Dcirect=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown(, COLL_UNROLL>8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ (t| id ^, n threads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl:670:15(: note: )field 'nthreads' will be initialized after field 'tidInBlock' .670 | r u tind(t(id), nthtreaids(dnt, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here hr eads), 17tid | InBDlockE(thFreaIdIdxN.x)E, g_roupn(grcoupc), l DevFunc(AllReduce_TREE_SIMPLE_| ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ MinMax_u8_4, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u8_4, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch670, | a tilgo, d(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~p ro t| o, tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_unro ll> ().run671(); | \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h :670:15: note: field 'nthreads' will be initialized after field 'tidInBlock's t670 | e tpidS(tiid)z, nethr(eadss(ntthreeadsp), StidiInBzloe_ == c0k(t hre?ad ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_Idx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' STEPS/s670 | tid(tid), nthreads(nthreads), tidInBlock(tizehof(Tr)eadIdx.x), gr : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_4, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u8_4, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u8_2, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u8_4, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_4, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u8_4, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunW/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u8_4, ncclFuncAllReduce, FuncorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(groMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ up), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_4, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_4, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u8_4, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_4, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u8_4, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_4, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_MinMax_u8_4, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u8_4, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_minmax_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_MinMax_u8_4, ncclFuncAllReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:1: In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ : warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.hIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ :145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, datIn file included from a2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp ^~~~~~~~~~~~~~~~~~:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h: warning: unused variable 'w' [-Wunused-variable] 75 | b:arrier_b29y_group(); : | ^~~~~~~~~~~~~~~~~~ 15/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: :expanded from macro 'barrier_by_group' /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hnote: expanded from macro 'barrier_by_group' 29 | 29 | c const oint w = thnreadIdx.xs/WARP_SItZE; \ | ^ int w = thIn file included from r:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ eadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21:In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp75 | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ : barr2ier_b: y_groIn file included from up(); /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h| ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h::29:15: 11note: expanded from macro 'barrier_by_group' 29: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h | : 80:co5n: warning: unused variable 'w' [-Wunused-variable] 80 | bst inat w = rthreadrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.hIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ :29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threa; | ^~~ dIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7In file included from : warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->In file included from chann/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2e: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11l: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: L/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14o: warning: unused variable 'data1' [-Wunused-variable] 145 | ui;nt32_ t da ta1, f| lag1 ^~~, da ta2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h145 | uint32_t data1, flag1, data2, flag2; | ^~~~~:366:15 : warning: unused variable 'bid' [-Wunused-variable] 366 | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE;In file included from \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | ui/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtnt32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flagr(20)+ll;128Of fset; | ^~~ | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] In file included from 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(In file included from 0)+ll128Offset; | ^~~/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const in/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:t2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h :11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:b175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.hi:80:5:d warning: unused variable 'w' [-Wunused-variable] 80 | = ncclShmem.channelId - work->channelLo; | b ^~~arri er_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ :366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from :218:/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: 15In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: :In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h :271:19:warning: warning: unused variable 'ptr' [-Wunused-variable] unused variable 'bid' [-Wunused-variable]271 | uint64_218t* ptr | = re cvPtr (0)+l l128O ffsetc; | ^~~o nst int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:In file included from 366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlo c565 | rk(threadIdx.x)unTre,eUpDown< group(group), T, RedOp, | ProtoSimpl ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~e<1, 1, COLL_UNR OLL>, COL| L_UNROLL>(t tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_id, nt 671 | hreads , work) ; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h :432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested heres 432 | t if (tied < subtpn) RunWorSkCize(stepSize_ =oll(_).run(t)id, subtn,{ work) ; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:| 7:1: ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | D | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here EFINE_ncclD254evFunc | (All Reduce _TREE_ SIMPLE _PreMu l PrimiSum_bft16ives, /*Directfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) =*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611C:62: note: expanded from macro 'DEFINE_ncclDevFunc'O 611 | LL_UNRO L RunWLorkB>atch,d algo,, p roton, unrtoll>(h).run(r); \e | ^ a/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:d15:s, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < sub tnote: field 'nthreads' will be initialized after field 'tidInBlock' n670 | ) tid( tid),R nthrueads(nntWhreaods), rtidInkBlock(Ctholl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllRedu note: field 'group' will be initialized after field 'stepSize'c e670 | _ TtidR(tiEd),E nt_hreSadsI(ntMhrePadsL), tEid_InBlPockr(thereaMulSum_bf16_2, ncclFuncAdIldx.lx),R geroudp(guroucp),e | , ^~~~~~~~~~~ FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_2, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_2, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ O_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_2, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ threads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_2, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSi/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_2, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ zes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*In file included from Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h: 670:15: warning: rinitializer order does not match the declaration order [-Wreorder-ctor] 670 | unTreeUpDowndx, COLL_U.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | NROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl st().run(t_) i{ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d , subt| group(groupn /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h,: work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_254:90: nnote: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | c cPrimiltives,llReduce_TREE_SIMPLE_PreMulSum_bf16_2, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO /*_DireSct=*I/0, MProtPo, 0> prLims E | , ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h 2:565:5): note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | | r^unTr eeUp/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hDown, C611 | OLL _U RunWorkBatcNRhOLL><(tidc, ntohrll, ty, redop, algo, proeatds, owork,); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78u: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here n 432 | r if (tid < subtn) RunWoroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthrekCoall(a)d.rsun()tid,, subttn,i wordk);I | n ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cppB:7l:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested hereo c7 | DEkFIN(E_ntcclhDevrFuneca(AlldRedIuced_TRExE_S.IMPxLE_)Pr,eMu lSugm_brf16o_2,u ncpclF(uncgAllRreduocup), e, FuncPreMulSum, hip_bfloat16, | ^~~~~~~~~~~ NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_2, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_2, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | i/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPSf (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_2, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn)/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_2, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_2, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_bf16_2, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ :a670:15: warning: dinitializer order does not match the declaration order [-Wreorder-ctor] 670 | I dtid(tixd), .nthrexads(nthrea)ds), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(s,t groeup(gropup), S| ^~~~~~~~~~~ ize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_2, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, al/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hgo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthread:670s:15: warning: )initializer order does not match the declaration order [-Wreorder-ctor] ,670 | titd(tiid)d, ntIhreands(Block(threadIdx.x), group(gnthrreads)o, tiudInBplock()thre,adId x.x) ,| ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), n grotup(hgroupr), e| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | a tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ d671 | s stepSize((stenpSthreads), tiidze_ I== 0 n? nBcclock(threadlShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NIdx.Cx), CgrouLp(gr_oupSTEPS/sizeof(T) : ste), | ^~~~~~~~~~~ pSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_2, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_bf16_2, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_2, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf16_2, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unro| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ id, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_2, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_bf16_2, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ Proto, COLL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf16_2, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_2, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlockIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSiz(threadIdx.x), group(group), | ^~~~~~~~~~~ e_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_2, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf16_2, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf16_2, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf16_2, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf16_2, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf16_2, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf16_2, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_2, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78:/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_4, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TRE if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf1E, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 6_2, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), gro/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_4, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ up(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_4, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduo, COLL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_4, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ce_TREE_SIMPLE_PreMulSum_bf16_4, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, N/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hCCL_ALGO_TREE, NCCL_PROTO_:670:15: warning: SIMPinitializer order does not match the declaration order [-Wreorder-ctor] LE670 | , 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h: t611id(ti:d), 62n: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RutnhreadWs(nthoreadsr), kBatch, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_2, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ncclShmem.p, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(grcooup), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreammd.buffsSizes[NCCL_PROTO(_SIMPLE]/NCnCL_StTEPS/hsizeorf(T) :e stepaSize_d) { s| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | ) group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h,:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here t254 | i PrimditivIes, /*Direct=*/0, Proto, 0> ck(threpadIrdx.x)i, groump(grosup), | ^~~~~~~~~~~ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_4, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_4, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_4, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPL:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ E]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWork/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_4, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidIColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_4, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGnBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ O/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unro/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_4, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthFranAsymmetric<1, NCCL_MAX_DEV_ARITY>, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_4, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ eads), tidIn/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf16_2, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Block(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_n/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthrecclDaevFuncd(AllResduce_T(REE_SnIMPLE_tPreMulhSum_bf1r6_4, nceclFuncaAllRedds), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671uce, | Func PreMul Sum, h ip_b float1s6, NCCtL_ALeGO_TREpE, NCCSL_PROTiOze(st_SIMPeLEpS, ize_ == 0 ? nc4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hc:611:l62: note: expanded from macro 'DEFINE_ncclDevFunc'Shmem.comm.buffSizes[NC 611 | RunWorkBatch, CL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: algnote: o, protin instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested hereo, un roll>( ).run(); \ | ^ 303/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670: | 15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid (tid), n Prithmreads(itnives ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h, /*D:670i:60:rect=*/0, Proto, 0> note: field 'group' will be initialized after field 'stepSize' p 670 | r tiid(timd)s | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h, nthr:eads(565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here nthreads), tidInBlock(threa565 | runTreeUpDown | ^~~~~~~~~~~ , COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_4, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_4, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_4, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, ProtCOLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf16_4, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(t/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group)id), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , 432 | if (t id < sub| tn) Ruo, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_PreMulSum_bf16_2, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ nWor ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~kColl().run(tid, subtn, work); 671 | stepSize(stepS | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cppi:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINze_ == 0 ? ncclShmemE_ncc.lDevFuncc(AllReducomm.buffSizes[NCCL_PROTO_SIMPLEe_RING_SIMPLE_PreMulSum_bf16_4, ncclF]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here un cAllReduce,63 Func | PreMulS um, hi p_bflo at16, N CCL_ALPGO_RINGr, NCCLi_Pmitives, 0, Proto, 0> primRsOTO_SI MPLE, 4) | | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h: ^611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h RunWorkBatc:h, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf16_4, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(ne5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid,dop, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nt nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_4, ncclFuncAllReduce, FuncPreMulSLL>().run(tid, subtn, awds),o tidrInBlkock()thre;adId x.x) , | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prgrouep(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads),Mul Sum_tbf16i_4, ndcclFIuncAnllBRedulce, oFuncPcreMuklSum(, hitp_bfhloatr16, eNCCL_aALGOd_um, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tiRINIG, NdCCL_xPR.OTOx_SIM)PLE, ,4) | d^(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ g /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:r611:62: onote: expanded from macro 'DEFINE_ncclDevFunc' up(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | 611 | RunWorkBatch, algo, proto, unroll>().run(); \ stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, P| ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hr:670o:15: tnote: field 'nthreads' will be initialized after field 'tidInBlock' o670 | , ti d(ti0d), >nthr eadsp(nthrreadis), tmidInsBloc k(th read| Idx. ^x) , group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nth/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: rein instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf16_4, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | ads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_4, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(432g:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf16_2, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hroup), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : s:t670:15: ewarning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | p tidS(tid), nithreads(znthreadse), tidInB_lock(thr)ea dI{ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | d x.x), grou p(group) , | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ Primitives, 0, Proto, lShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | stepS ize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254i:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here f254 | (tid < subtn) RunWorkColl, /*Dirtect=*o/0,, Prot o, 0COLL_UNROLL>().r> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf16_4, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING,m ple, PCOLLR_UNROOLL>T(tidO, nth_reaSdsI, woMrk);P | ^ L/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:E432:78:, note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 4324 | if) (ti d < subtn) RunW| orkC^oll< Fn,/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h T, RedOp, Algo, Proto, COLL_UNROLL>().run(tid, subtn, work);:611:62 : note: expanded from macro 'DEFINE_ncclDevFunc' 611| | ^ Ru nWor/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cppkBatch, 0, 2, 4>::run' requested here redop, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_4, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf16_4, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf16_4, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf16_4, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_4, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_4, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf16_4, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf16_4, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf16_4, ncclFuncAllReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for gfx906. 18 warnings generated when compiling for gfx908. 18 warnings generated when compiling for gfx1101. 18 warnings generated when compiling for gfx1102. 18 warnings generated when compiling for gfx942. 18 warnings generated when compiling for gfx1100. 18 warnings generated when compiling for gfx1201. 18 warnings generated when compiling for gfx1200. 18 warnings generated when compiling for gfx1030. 22 warnings generated when compiling for gfx90a. [ 60%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_bf8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_bf8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_bf8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_bf8.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75::7: warning: unused variable 'w' [-Wunused-variable] warning: unused variable 'flag1' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15 : note: expanded from macro 'barrier_by_group' 29145 | | c uint32_t donst int w = threadIdx.x/WARP_SIZE; \ | ^ ata1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group'In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h 29 | :const 366int w := thre15adI:dx.x/W ARP_SIwarning: ZE; \ unused variable 'bid' [-Wunused-variable] | ^ 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | bar/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cppr:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:i27:15: warning: unused variable 'bid' [-Wunused-variable]e 27 | r co_nst bint bidy = ncc_lShmem.cghannelrId o- work-u>channpelLo; ( | ^~~ ); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:14:: warning: unused variable 'data1' [-Wunused-variable] 15145 | : uint3 2_t data1, flag1, warning: dataunused variable 'bid' [-Wunused-variable]2 , f218l | a g 2 ; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145 const int bid = ncclShmem.channelId - work-:21: >warning: unused variable 'flag1' [-Wunused-variable] c145 | h uint32a_t dnata1n, flaeg1, dalta2,L flag2;o; | ^~~ | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = thIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] readIdx.x/WARP_SIZE; \ | ^ 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = co nst intn bid = cncclShmecm.channlelId - Shmemw.ork->chcahnannelInelLdo ;- | w ^~~ork->chann elLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h::11218: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174:: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h::366:151575: warning: unused variable 'bid' [-Wunused-variable] :: 7366 | warning: c:onstunused variable 'bid' [-Wunused-variable] in t warning: 218 | unused variable 'w' [-Wunused-variable] 75c | on bs id = tnc cl iS hn mt bibarrdiem.chae = ncclr_Sby_grouhpmem.channelId(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h-:29:nne15 lId - w:work->coh annelrLnote: o; | ^~~kexpanded from macro 'barrier_by_group' - >c29h | annelLo ; cons t i| n ^~~t w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h::366:15174: warning: : unused variable 'bid' [-Wunused-variable]/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h : 145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.hIn file included from :29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx./builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_groupx/W(ARP_SI)ZE; \ ;| ^ | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const intIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | co/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hnst int w = th:366:r15: warning: unused variable 'bid' [-Wunused-variable] e366 | caonst indt bid = IncclShmdem.chxannelId. - worxk->cha/nnelLo;W | ^~~ ARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nth RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ p, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670: | 670: tid(tid)15: warning: ,initializer order does not match the declaration order [-Wreorder-ctor] 670 | nthr tieads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, d(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreaIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.ds(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x)/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm., group(group), buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here | ^~~~~~~~~~~ 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), n/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreathreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ds), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group 18 warnings generated when compiling for gfx908. /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NC/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ =CL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ = 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(thre/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here a 432 | d if (tid < subtn)I RunWorkdColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ up(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_bflo/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ at8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | : 670 tid(tid:), nthr/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NC15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreadCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group s(nthr/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.he:a63d:s56),: tnote: iin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested hered InBlock (63t | h r e a dPIdrxi.mxi)t,i vgerso, 0 ,671 | P r o t os,t e0p>S ipzrei(msst e p| S ^i ze_ == /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h0: 558?: 5nc:c lnote: Sin instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested hereh mem.c o558 | m m . bu frfuSnRiiznegs<[TN,C CLR_ePdROOpT, OP_rSoItMoP,L EC]O/NLCLC_LU_NSROTLELP>S(/tsiidz,e onft(hTr)e a: dsst,e pwSorikz)e;_ ) | { ^ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.ht:i303d: 90<: snote: uin instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested hereb tn) RunWor k303C | o l l < F n ,P rTi,m iRteidvOeps<,T ,A lRgoe,dO pP,r otFoa,n AsCyOmLmLe_tUrNiRcO (N)C.CrLu_nM(AtXi_dD,E Vs_uAbRtInT, Yw>,o r/k*)D;i r e| c ^t =*/0, Proto,/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp 0:>22 :p1r:i mnote: sin instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h22: | 565D:E5F:I Nnote: Ein instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here_ ncclDe v565F | u n c( A lrluRneTrdeuecUep_DRoIwNnG<_TS,I MRPeLdEOp_,P rPerMoultoSuSimm_bpfle8<_14,, 1n,c cCOlLFLu_nUcNARlOlLRLe>d,u cCeO,L FLu_nUNcRPOrLeLM>ul(Stiudm,, nrtchcrle_abdfsl,o awto8r,k );N C C| L ^_ ALGO_RING, N/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hC:C432:L78_:P Rnote: Oin instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested hereT O_SIMPL E432, | 4 ) | ^i f (tid /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h<: 611su:b62t:n) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_4, ncclFuncAllReduceeads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? n, cclShmem.FcuncPreMulSumomm.buffSizes[NCCIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ , L_Prccl_bflROTO_SIMPLE/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ =/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here oat8, NCCL_ALGO_TREE , NC432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here CL_PROTO_SIMPLE, ]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | 4) if (ti| d < subt^n) /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h Ru 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch< note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ coll, ty, redopl()go, pr.orun(tito, unroll>d(, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ ).run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670::670:15: | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ note: field 'nthreads' will be initialized after field 'tidInBlock' 15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid),= 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group nthreads(/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, wo | , tid(ti drk); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFun), ntthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ idInBlock(c(AllReduce_RING_SIMPLE_PreMulSum_bf8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:threadIdx.x), group(group62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkB)atch, algo, proto, , | ^~~~~~~~~~~~~~~~~unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/s/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threaizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitiveso,, /C*ODLiLr_eUcNtR=O*L/L0>,( )P.rroutno(,t i0d>, psruibmtsn , | w ^o rk); | ^/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h :565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp : 22 : 1ru:n Tnote: rin instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested heree eUpDown_,S ICMOPLLLE__UPNrReOMLuL>lS(utmi_db, f8n_t4hr, enacdscl,F uwoncrAkl);lR e d| u ^ ce, FuncPreMu/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hl:S432u:m78,: rnote: cin instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested herecl _bfloa t8432, | N C C L_ AiLfG O(_tRiIdN G<, s NubCtCnL)_ PRRuOnTWOo_rSkICMoPlLlE<,F n4,) T ,| ^R edOp, Alg/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.ho:,611 :P62r:o tnote: oexpanded from macro 'DEFINE_ncclDevFunc', COLL _611U | N R O L LR>u(n)W.orruknB(atticdh,< csoulblt, ty, redop, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ n, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for gfx1102. /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_bf8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for gfx1200. /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_bf8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for gfx1101. 18 warnings generated when compiling for gfx906. 18 warnings generated when compiling for gfx1201. 18 warnings generated when compiling for gfx1030. 18 warnings generated when compiling for gfx1100. 18 warnings generated when compiling for gfx942. 22 warnings generated when compiling for gfx90a. [ 61%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_f16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_f16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_f16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_f16.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | In file included from const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int wIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] In file included from 75 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_In file included from t /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t datIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ a1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: 145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | constIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: int w =expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, fla:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hg2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ :11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2w: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from o/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271r:19: warning: unused variable 'ptr' [-Wunused-variable] k 271 | - uin>t64_t* cptr = rhecvPtr(a0)+ll128nOffset;n | ^~~ elLo; | ^~~ ); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ :14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work-In file included from >channelLo; /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | | ^~~ barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_gro:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ up(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h In file included from :218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclSIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:hmem.channelId - work->channelLo; | ^~~ 2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int P_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2,In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ w = threadIdx.x/WARP_SIZE; \ | ^ flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint3In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ 2_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ elId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_2, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(sten, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_2, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatpSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Protch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(toi, 0> pridms | ^ )/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5:, note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 565 | nthreads(nthreads), ti runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_2, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_2, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthread670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ s), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_2, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_2, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_2, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid)), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSi303:90: znote: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | e Pri(mitives, /*Dizrect=*/0e,_ == 0 ? ncclShmem.comm.buffSizes[NCCL_P Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tidROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Di, nthrreads, weork); c | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.ht:432:78:= note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here * 432 | / if0 (tid ,< subtn ) RunWoPrkCollr prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here dOp, Alg565o, Pro | to, CO LL_UNR OLL>() .run( runTretid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | eUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (PtreMuilSumd_f16 _2, | ^ (/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:)62: note: expanded from macro 'DEFINE_ncclDevFunc'.run(tid, su b 611 | t RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670roup), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0n, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_2, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_S | I MtidP(tiLd),E nt,hreads(nt hre2ads)), tid InBlock(| thr^ead Id/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hx.x), g:r611oup:(gr62oup:) ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_2, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f16_2, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_2, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | D/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_2, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ EFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f16_2, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthrea/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_2, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ds), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_2, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkCol:670:15:l warning: initializer order does not match the declaration order [-Wreorder-ctor] <670 | tiFd(tid),n nthrea,ds(nthr eads), TtidInBl,ock(thr eRedOadpIdx, Algo.x,), gro Proto, COLL_UNROLL>()up.(group)r, | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ u | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | n (tid, sstepSiubtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cppze(stepS:ize_ 7== 0 ?:1: n cnote: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFIclNShmem.comEm.b_uffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_2, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_eSpSizIe_) { M | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ P | group(group L/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:E63:56,: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | 2 Pr)imitiv es /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkB, a0, Prtoto,c 0> prhims <| ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hc:558:5:o note: ll, ty, rein instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 2>, 2>' requested hered 558 | o rpun, aRling(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tunroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ id, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f16_2, ncclFuncAllRedIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_2, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ uce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_2, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncPreMulSum, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_f16_2, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.hulSum_f16_4, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncPreMulSum, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_f16_2, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCC/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_4, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(L_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TR/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tiIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:d(tid), nt2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncPreMulSum, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_f16_2, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ hreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670L_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_2, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f16_2, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_4, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f16_2, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBat tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), grch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ oup(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h), nthrea:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f16_2, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ds(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_2, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDev/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_2, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PRFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_4, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidIty, redop, algo, protonBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), g/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hroup(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] )670 | , tid(t id | ^~~~~~~~~~~ ), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f16_2, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f16_2, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_4, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeo/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hf(T) : s:670:15:t warning: initializer order does not match the declaration order [-Wreorder-ctor] 670e | tid(ptid), nthrSeads(nthrieads), tizdI/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ e_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~nBl ock(th | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here readId x.x), group(gro63 | Priup), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ =mitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRi/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here ng, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Proto, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78:= 0 ? ncnote: clShmin instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested hereem.comm .buffSi zes[NCCL_PROT432O_SIMP | LE]/NC CL_STEP S/sizeof (T) : s tepSize _) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hi:254:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested heref 254 | Pri(mitivest, /*Direct=*/0, Proto, 0> priRedOp, Algo, Proto, COLL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f16_4, ncclFuncAllReduce, FuncPreMulSum, halms f| ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h,:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested hereN 565 | C ruCnTreeUpDown, CIOLL_NUNROLGL>(tid, nthreads,, wor k); N| ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hC:CL432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(UtNROLhL>()r.rune(taid, dsubtIn, wdork);x | . ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cppx:17:1): note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here, 17 | DEFgINE_rnccloDevFuunc(pAllR(educge_TrREE_oSIMPuLE_PpreMul)S, | um_f16_4, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads)note: ,expanded from macro 'DEFINE_ncclDevFunc' 611 | t i RudnWoIrkBnatcBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ h, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nth/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f16_2, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ reads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, w/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTOork); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_4, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlo, ProtoSimple<1, 1, COLL_UNROLL>, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < ck(threadIdx.x), group(group), | ^~~~~~~~~~~ subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_4, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_4, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_2, ncclFuncAl/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_4, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreadlReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), s(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nt/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f16_4, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ e(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROSTEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, workTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432 :78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | 565 | if (ti d < su btn) R unWork ColleUpDow().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIn, CSOLL_uUNROmLL>(_tid,f nth1read6s, w_ork)2; ,| ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hn:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here cclF432 | if (tid < subtn) RunWorkColl().run(tuncAllReduce, FunicPredMulS,um, half, NCCsL_ALuGO_RbING, tNCCLn_PRO,TO_S IMPLEw, 2)o | ^r /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62:k note: expanded from macro 'DEFINE_ncclDevFunc' )611 | ; R unWorkBat/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllRedch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthread | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_4, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: s(note: nthexpanded from macro 'DEFINE_ncclDevFunc'rea d s), ti611d | InB loc k(t hre adIRdx.xu), ngroWup(ogrorup),k | B ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.ha:670t:60:c note: field 'group' will be initialized after field 'stepSize'h 670< | uce_RING_SIMPLE_PreMulSum_f16_4, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ c toid(ltidl), ,nth reatdsy, redo(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ p, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_4, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().rugroup(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ n(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_4, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_4, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(t/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_4, ncclFuncAllReduce, FuncidPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ %WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWork/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hColl, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_4, ncclFunRedOp, Algo, Proto, COLL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_PreMulSum_f16_2, ncclFuncAllReduce, FuncPreMulSum, hcAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ alf, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f16_4, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_4, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives,/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().rlShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stun(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMuepSilze_) S{ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~u | group(group m/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:_63:56: note: fin instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 631 | Prim6_4, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWoitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432rkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(n 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl()threads), tidInBlock(threadIdx.x), group | ( g rif o(tiud

().670run | (ti d, s ub tn, wortk);i | d ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp(:t12:1i: note: din instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here )12 | D,EFI NE_nn.run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f16_4, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ cctlDehvFrunec(aAlldRse(nthreduce_RING_SIMPLE_PreMulSum_f16_2, ncclFuncAllReduce, FuncPreMads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f16_4, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ ==/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nth 0 ? ncclrShmem.comm.bueffSizes[NCCLa_PROTO_SIMPLdE]/NCCL_STEsPS/sizeof(T) ): stepSize_), { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h ti:63dInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Pr 671 | stepSize(stepSize_ ==imitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize({ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(grouptid, nthre ads, work/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here :303:90: note: 432 | in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | if (ti Primidt < ives, /*Direct=*/0,, Prot o, COLLP_UNROLrL>().orun(titd, subotn, w,ork); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DE 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tiFIdNE_ncc,lDevFunc(AllReduc e_RINGn_SIMPLtE_PreMhulSreads, woumr_f16_k4, nc)clF;uncAl lRed uce, | FuncP ^reMu lSum, /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hhalf, NCCL_A:LGO_R432ING, :N78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here CCL _PR432 | if (tid < subtn) RuOTO_SIMPLE,n 4) | W^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:o611:62:r note: expanded from macro 'DEFINE_ncclDevFunc' k611 | Coll, algoedOp, Algo, Proto, COLL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDev,F protou, unrnoll>c().run(();AllRedu \c | e ^ _TREE_SIMPLE_PreMulSum_f16_4, ncc/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hl:670:15F: unote: field 'nthreads' will be initialized after field 'tidInBlock' n670 | c tAid(tlid), lnthrReadse(nthdreuads)c, tiedInBl,o ck(FthreaudIdxn.x),c groPup(greMulSum, half, NCCL_ALGO_TRrouEp), E | ^~~~~~~~~~~~~~~~~ ,/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670: 60: note: Nfield 'group' will be initialized after field 'stepSize' C670 | C tiLd(ti_d), PnthrReadsO(nthreads), tidInBlock(threadIdTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); x.\x), gr oup| (gr ^oup ), /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h | ^~~~~~~~~~~ :670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_4, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_4, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f16_4, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f16_4, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f16_4, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f16_4, ncclFuncAllReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for gfx942. 18 warnings generated when compiling for gfx1101. 18 warnings generated when compiling for gfx1030. 18 warnings generated when compiling for gfx906. 18 warnings generated when compiling for gfx908. 18 warnings generated when compiling for gfx1100. 18 warnings generated when compiling for gfx1102. 18 warnings generated when compiling for gfx1200. 18 warnings generated when compiling for gfx942. 18 warnings generated when compiling for gfx1201. 22 warnings generated when compiling for gfx90a. [ 61%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_f32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_f32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_f32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_f32.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ : note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: : In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:14580:14: warning: unused variable 'data1' [-Wunused-variable]: 1455 | :uint3 2_t warning: data1unused variable 'w' [-Wunused-variable], fla g1, data2, fla80g2; | | ^~~~~ barrier_by_gr /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:o145:21: uwarning: unused variable 'flag1' [-Wunused-variable] 145 | p u(int32)_t da;ta1, flag1 , dat| a2, ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uinIn file included from t/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: 3In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: 2In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: _/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:t7: warning: unused variable 'w' [-Wunused-variable] 75 | d acon stt ibnat wa 1= tr,hreadr Idflag1, data2, flag2; ier| _by_g ^~~~~roup( ); /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h| ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15:: note: expanded from macro 'barrier_by_group'x145. x/: WAR35P_S:29IZE; | \ | ^ const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cppwarning: unused variable 'flag2' [-Wunused-variable] :145 | 2 uin: t32_In file included from t dat/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.ha1, f:lag1,11 data: 2, fIn file included from lag2;/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h | ^~~~~ :175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | In file included from uint3/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: 2In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: _In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.ht: data1, flag1, data2, flag2271:;19: warning: unused variable 'ptr' [-Wunused-variable] 271 | | ^~~~~ uint 64_t*/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h ptr =:145:35: warning: unused variable 'flag2' [-Wunused-variable] recvPtr(0)+ll128Offset; 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~| ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclIn file included from Shmem.channelId - work->cha/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2n: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hn:11: In file included from e/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175l: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:Lo; | ^~~ 80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h; \ | ^ :218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid218 | = co nstn inct bcid l= nScclhShmmem.echIn file included from m.channealnnIelIdd - wo-rk- >chwannoelLro; k| ^~~ ->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvP/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.htr(0)+ll128Offset; | ^~~ :366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | consIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ t int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | consIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ t int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ :218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = n/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ cclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2,In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barr/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hi:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ er_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_2, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGOIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_2, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ _TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ In file included from | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0,: Proto, 0>670 p:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670r | ims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h: ti565:5: dnote: (tid), nthreads(nthreadsin instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here) 565 | , runTr eeUpDowtn tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_, COLL_ UNROLL> (/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cppt:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11i: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h671:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hd: | 670:15: warning: , initializer order does not match the declaration order [-Wreorder-ctor] 670 | tin d(tid) t, nthrhseads(tnthreades), rtipdInBeSlock(tihreadIzadx.x), edgroup((sgrous,p), | t ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_epSiwork); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_2, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ z 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizee_ s== 0 [? ncclNShmemC.commC.buffSizes[NCCL_PLROTO__SIMPPLE]/NCCL_STEPS/siROTO_SIzeof(MT) PLE]/NCC: stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here T 254 | Y > Prim,itiv esymmetric, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTr eprimes | U ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hp:565:5:D note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here o 565 | w rnunTr, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tipd, Pro toSi, nCOLL)_UNRO LL>R(tid,u nthnreadWs, woork)r; | k ^Coll, 0, 2, 2>::run' requested here 432A | l go, Proto, CO if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:7:1LL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_2: ,note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here n7 | cDEFcINEl_ncFclDuevnFcunAc(AlllRleduRce_eTREdEu_ce, FuncPrSeIMPMLEu_PrelMulSSuumm_, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2f)32 _2 , | nc^c l/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_:611P:62R: Onote: expanded from macro 'DEFINE_ncclDevFunc' T O611 | _ S IRuMnWPorLkBEatc,hR, ualngo, proWtoo, runkroBll>a()t.rucn(ha, aldgs)o,, ptriodtoI, nuBnrlolol>c()k.r(unt()h; r\ e | ^a /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hd:670:I15:d note: field 'nthreads' will be initialized after field 'tidInBlock' x 670. | x ) t,id (tgid), rntoup(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | hreads(nthreads), tidInBlock(threadIdx.x), group(group), tid| (t ^~~~~~~~~~~~~~~~~i d)/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h, nt:hr670ea:d60s(:nt hrnote: eafield 'group' will be initialized after field 'stepSize'ds ), tidInBl670oc | k( th re adI dx.x),tid(tid), nthreads(nthread group(group), | ^~~~~~~~~~~ s), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_2, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_2, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | ste.x), pgroup(grSoup), | i ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | z stepSeize(step(Size_ ==s 0 ? nccltShmem.coemm.buffSipzes[NCCL_SPROize_ == 0 ? ncclShmem.comm.buffSTiO_SIMPLEz]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | c<1 ru>, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, ntPrnTreoeUpDowntL, COL_L_UNRUOLL>(Ntid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: Rnote: OLL>(in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here).r un( tid, s432ubtn, | wo rk); if (tid | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp :12:< subtn) RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_2, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ dOp, Algo, Pr1o: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested heret o, COLL_UNROLL>().run(tid, 12 | sDEFuINEb_nctclDnevF,unc (AlwloReduce_RING_SIMPLE_PreMulSum_f32_2, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_rkP); R | ^O /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cppT:7:O1: _note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here S 7I | DMEFIPNE_nLcclEDev,Fun c(A2llR)edu ce_ TRE| E_^SIM PLE/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h_PreMulSum_fIn file included from 32_2, ncclFuncAllReduce,: 611:62F: note: uexpanded from macro 'DEFINE_ncclDevFunc' n611 | c RPunWrorkeBatMch,G aOwarning: lgo_initializer order does not match the declaration order [-Wreorder-ctor], T prR EotoE,670 u, | nr oIn file included from N/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group C C tLill_d>P(().RtruOinTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RundW), onthrreakdBs(nathrteadcs),h t, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_f32_2, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ 670, | t id(| tid ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~), nth rea| ds( tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_nt hre ads), 671tid | InB loc k(t hre adIdsx.xt), egcoprllSo, uitypz, (ere(gdosrp,u alpgo,) pr,oto , u nro| ll> ^~~~~~~~~~~~~~~~~(). run/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h(); \ : | ^670 te:/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h60pSiz:e :670_note: :=field 'group' will be initialized after field 'stepSize'=15 :0 ? nnote: 670cfield 'nthreads' will be initialized after field 'tidInBlock'c | l S h m tid(ti670 | d tid(emt.coimm.dbuf)fSi,zes [N)n,Ct CnhtLhrr_eaePdsaR(nds(nthreads), tidInBOTOl_SoIMPcLE]k/NC(CL_tSTEPhS/srizeeof(atThdr)eIad ds):x, .tisxdIt)nBl,eoc pk(gSthrireozadueIdp_x.()x) g, {rgr oo uu| p(p ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gr) o, up) | , group(group | | ^~~~~~~~~~~ ^~~~~~~~~~~~~~~~~/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(:254:t90: inote: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here d 254) | , Prinmittivehs, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNRlOock(threadIdx.x), group(group), | ^~~~~~~~~~~ LL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_2, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_2, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_f32_2, ncclFuncAllReduce, FuncP/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_2, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ reMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_2, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBIn file included from lock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((ti/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBd%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~loc k(th readId| x.x), group(groupgroup( group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h 671 | stepSize(step:Size_ 503== 0 ? :ncclSh9mem.: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-cnomm.butffSizesh[NCCL_rPROTO_SeIMPLE]a/NCCL_dSTEPS/siszeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitivesup, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(Op,t FainAsymmetdric, /*hDirrect=*/e0, Paroto,d 0> prsims | , ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h: 565:work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here432 565 | | runT reeUpD own< if (tid < subtn) RuT, RedOp, ProtoSimple<1, 1, COLL_UNROLL>, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432nWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_f32_2, ncclF:78: note: uin instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | n cif (tiAd < sulbtn) RlunWorRkCollC().CrunL(ti_d, AsubtLn, GworOk);_ T | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_2, ncclFuncAllReduREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ ce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize:_670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] =670 | = tid( tid),0 nthr eads?(nthr eads)n, tidcInBlocck(thlreadSIdx.xh), grmoup(geroup)m, | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~. | c tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671o | stmepSimze(st.epSizbe_ ==u ffSiz0 e? nccs[NCCL_PROTO_SIMPlShLmem.cEomm.b]uffSi/zesNCCL_STEPS/sizeof([NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSizeT) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 1 254 | , P rimit1ives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | L L_UNROLL>, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_2, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), n if (tid < subtn) RunWorkthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Coll().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hAllReduce_TRE:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidE_SIMPLE_PreMulSum_f32_4, ncclFuncAllReduce, FuncPreMulSInBulocmk(t,hrea dIfdx.lx),o graoupt(gro,up NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatchcc,lShm em.acomlm.bguffoSiz,es[ NCCpL_PrROToO_StIMPoLE],/NC CL_uSTEnPS/rsiozeofl(T)l : >ste(pSi)ze.run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_2, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group),dOp , Fan| Sy ^~~~~~~~~~~~~~~~~mm etric/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h<1>,: 0670, :Pro60to,: 0> prnote: imsfield 'group' will be initialized after field 'stepSize' | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h :558:5:670 note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here | 558 | run Ringt(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f32_2, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_2, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COL L_UNROLL>(tid, n threads , work);t | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hi:432:78d: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here (432 | t if (tidi d), nthreads(nthreads), t< subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) IN{E_ncc lDevF unc(Al| lRedu ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ce_TR EE_SI MPLE_| PreMu group(grouplSum _f32_/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h2, ncclFuncAllRedu:ce, F303uncPr:eMulSu90m, f: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, algo, proto, unredOp, FanAsymmetric<1, NCCL_MAX_DEV_ARITY>, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78:o ll>(note: ).ruin instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested heren( ); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h432:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthre | a d s if) (t,id < tsubitn)d RuInWonrkCBolll().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce ^~~~~~~~~~~ _TREE_SIMPLE_PreMulSum_f32_4, ncclFuncAllReduce, Func/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f32_2, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ x.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_2, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunW/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x)orkColl().run(tnthreads), tidInBlockid, subtn(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWo, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_rkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f32_2, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto,SIMPLE_PreMulSum_f32_2, ncclFuncAllR unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ educe, FuncPreMulSum, float, ,N group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_4, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ CCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(thr/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn)eadIdx.x), group(group), | ^~~~~~~~~~~ RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_2, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkC /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.holl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f32_2, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f32_4, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_2, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO if (tid < subtn) RunWorkColl().run(tid, subtn, work_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f32_2, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h[:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f32_2, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_4, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f32_2, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_4, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f32_2, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f32_2, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_4, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_4, ncclFu/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInncAllBReduce, lFuncPreoMulSum,c float, kNCCL_(ALGO_TRtEE, NCCLh_PROTO_rSIMPLE,e 4) | ^ a/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62:d note: expanded from macro 'DEFINE_ncclDevFunc' 611 | I RundWorkBatcxh,) algo, p,roto, u group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ nroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nt 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : hsreadst), tiedInBlpock(thSreadIidx.x)z, grouep(gro_up), ) | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h :670:60: {note: field 'group' will be initialized after field 'stepSize' 670 | tid(| tid), ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_SIMPLE_PreMulSum_f32_4, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: _TREE_SIMPLE_PreMulSum_f32_4, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_L/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f32_4, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nL128_PreMulSum_f32_2threads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_4, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_4, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | : tid(670tid), nt:hreads(nt15hreads),: tidInBloc k(threadIwarning: dx.x), ginitializer order does not match the declaration order [-Wreorder-ctor]roup(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffS 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_)izes[NC CL_PRO{TO_SIMP LE]/N CCL_S| TEPS/s ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~izeof(T ) : st epSize_| ) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown(tidl, nthereads<, work1); ,| ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h :432:78:1 note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here , 432 | iCf (tiIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | Od < sLubtn)L RunW_orkColl, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here nU, T,N RedOpR, AlgoO, ProtLo, COLLL_UN>ROLL>(,).run (tid,C subtOn,LL_UNROL work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_nc 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f32_2, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tidclDevFunc(AllReduce_RINL>(tid, nthreads, work)), nthreads(nthreads), tidInBlock(threadIdx.x), group(g; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here G_SIMPLE17_PreM | ulSumD_f32_E4, ncFclFunIcAllReNduce,E FuncP_reMulnSum, cfloatc, NCClL_ALGOD_RINeG, NCvCL_PRFOTO_SIuMPLE,n 4) c| ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h(:611:62A: note: expanded from macro 'DEFINE_ncclDevFunc' l 611 | l RRunWorekBatcdh, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:IMPLE_PreMulSum_f32_4, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); 670:\15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | | tid(t ^id), nthr/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.heads(nthreads),: tidI670nBloc:k(thr15eadId:x.x), grounote: p(grofield 'nthreads' will be initialized after field 'tidInBlock'up), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_4, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_4, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primi/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | tives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().ruRITY>, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_4, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , ProtoSimple<1, 1, COLL_UNROLL>, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_4, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f32_4, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_4, ncclFuncAllReduce, FuncPreMulSum, float, NCC NCCL_MAX_DEV_ARITY>, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, ntL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ hreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_4, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(thr:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | teadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.mcem.coomm.bumffSizmes[.NCCLb_PROTOu_SIMPfLE]/NfCCL_SSTEPS/isizeozf(T) e: stepsSize_[) { N| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | C group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hC:63:56:L note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here _PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h: 6363 | : Prim56itives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); ro| to, ^COL L_UN/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hROLL>(tid,: nth432read:s, w78ork:); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hnote: :432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested herein instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if 432(tid | < s ubtn ) R unWo rkCo ll(). run(tsid,ubtn) RunWorkColl subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllRe().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f32_4, ncclFuncAllReduce, FuncPrducee_RMING_SIMuPLEl_PrSeMuulSumm_,f32 _4,f nclclFouncaAlltRed,uce, FuNncPCreMCulSLum,_ flAoat,L NCGCL_OALG_O_RRINGI, NNCCLG_PR,OTO _SIMPLE,N 4)C | C^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hL:611_:62:P note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unrollR>OT(O_)SI.MPLrE,u 4n) ( | )^ ;/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h :611\:62 : note: expanded from macro 'DEFINE_ncclDevFunc'| ^611 | RunWorkBa /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.ht:c670:h15:< coll, ty, redop, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidnote: field 'nthreads' will be initialized after field 'tidInBlock'I n670B | l o tcidk(t(idt),h nrthreeaadsd(nItdxhr.eaxds)), ,ti dIgnBrloocku(tphr(eagdIrdoxu.xp), )gr,ou p (gr| ou ^~~~~~~~~~~~~~~~~p) , /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h | ^~~~~~~~~~~~~~~~~ :/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h670:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_4, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | :15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_4, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBl/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreocads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_4, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ k(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f32_4, ncclFuncAllReduce, FuncPre/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hMulSum, float,:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] N670 | Ctid(tiCd), nthLr_ALGeads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stO_eRING, pNCCL_PSROTO_SiIMPLE, z4) | ^e /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611_:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | = RunW=orkBatc h, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid( ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f32_4, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NC:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f32_4, ncclFuduce_RING_SIMPLE_PreMulSum_f32_4, ncclFuncAllReduce, FuncPreMulSum, floatncAllReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlo/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hck(thr:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x),eadIdx.x), group(group), | ^~~~~~~~~~~ group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f32_4, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f32_4, ncclFuncAllReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 22 warnings generated when compiling for gfx90a. 18 warnings generated when compiling for gfx1201. 18 warnings generated when compiling for gfx1102. 18 warnings generated when compiling for gfx1030. 18 warnings generated when compiling for gfx942. 18 warnings generated when compiling for gfx1100. 18 warnings generated when compiling for gfx1200. 18 warnings generated when compiling for gfx1101. 18 warnings generated when compiling for gfx908. 18 warnings generated when compiling for gfx906. 1818 warning warnings generated when compiling for gfx1200. s generated when compiling for gfx1100. 18 warnings generated when compiling for gfx1101. 18 warnings generated when compiling for gfx1030. 18 warnings generated when compiling for gfx1201. 18 warnings generated when compiling for gfx906. 18 warnings generated when compiling for gfx1102. 18 warnings generated when compiling for gfx908. 22 warnings generated when compiling for gfx90a. [ 61%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_f64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_f64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_f64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_f64.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp [ 61%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_f8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_f8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_f8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_f8.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y,In file included from head, ma/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:1: In file included from n/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: tIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:issa18: warning: ;unused variable 'y' [-Wunused-variable]In file included from | 77/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp ^:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12 | : In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from uint32/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18_: warning: unused variable 'y' [-Wunused-variable] 77 | t u int32_t y,y head, , head, mantissa; | ^ mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable]In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable]by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ _t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrieIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ r_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ const int w = threadIdIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ x.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ ; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2In file included from : In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: In file included from unused variable 'w' [-Wunused-variable] 75 | In file included from barri/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2e: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174r: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable]_ 145 | uibnt32_t data1, flag1, data2, flag2; /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7 | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145::21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uintIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ warning: unused variable 'w' [-Wunused-variable] 75 | bar32_t darta1, flag1,ier_by_group(); data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable]y_g roup(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | In file included from ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ :2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.hIn file included from :174/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:: 11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7:: In file included from 75:7: warning: unused variable 'w' [-Wunused-variable] 75 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.hwarning: unused variable 'w' [-Wunused-variable] :75 | 174 barrie: r_by_/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | con/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | ba barrrier_byr_group(); | ^~~~~~~~~~~~~~~~~~ i/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: enote: expanded from macro 'barrier_by_group' 29 | r const_ int w b= threaydIdx.x/W_ARP_SIZgE; \ | r ^ In file included from oup()75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from st int w = threadIdx.x/WARP_SIZE; \ | ^ ; | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, fl/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cppag1, :data22, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | In file included from uint32_t dat/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ a1, flag1, dat/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: aIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:112: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: ,warning: unused variable 'data1' [-Wunused-variable] 145 | f uintl32_t daata1,g flag21, dat;a2, fl ag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h| :145 ^~~~~:21: warning: unused variable 'flag1' [-Wunused-variable] /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h145 | : In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from u/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h::174: i/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145145:14n:: warning: unused variable 'data1' [-Wunused-variable]t 28 1453 | : 2 uin_twarning: 32_ttunused variable 'data2' [-Wunused-variable] d at a1d, aflag1t145, d | aat a2, f l/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cppag2 :1 uint32_t data1, flag1, data2, flag2; | ^~~~~ 2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->ch, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:; 145| ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, fla/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145g:35: warning: 2unused variable 'flag2' [-Wunused-variable] 145; | | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h: uint32_t data1, flag1, data2, flag2; | ^~~~~ annelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ :218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group();:175 | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ : /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h: | b80arrie:r_by_5group:(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.hwarning: :29:15:unused variable 'w' [-Wunused-variable] note: expanded from macro 'barrier_by_group' 29 | const 80int | w = t hreadI dx.x/ WARP_ SIZE;b \ | a ^ rrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15:/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP note: _expanded from macro 'barrier_by_group' 29S | cIonstZ int Ew = t;hread Idx.x/\WARP_S IZE; \ | | ^ ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_tIn file included from * ptr = rec ui/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ nt64_t* pvt/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpprP = r:tecvP2rtr(: 0()+lIn file included from l0128/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hOf)fs:et+; 11 | l ^~~: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19l128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_bIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ y_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from : warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flaIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~g1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = In file included from ncclShmem.channelId - work->channelLo;In file included from | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channe/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2l: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:I15: warning: unused variable 'bid' [-Wunused-variable]d 27 | co-nst i nt biwd = nocclShrmem.ckhanne-lId -> workc->chanhnelLao; | ^~~ nnelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ :218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId -/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h work->channelLo; | ^~~:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h const int bid = ncclShmem.channelId - work->channelLo; | ^~~ :366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | conIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: st int unused variable 'w' [-Wunused-variable] 75 | barrier_by_grbid oup(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ = ncclShmem.channelId - work->cIn file included from hannelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ - work-/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ >channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 218 | conIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, st int bid f=lag1, data2, flag2; | ^~~~~ ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_f8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_f8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid,/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_n | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadcclDeIvFunc(dAllRedxuce_TRE.E_SIMPxLE_Pre/MulSumW_f64_2,A ncclFRuncAllRPeduce,_ FuncPSreMuIlSum, ZdoublEe, NCCL)_ALGO,_TREE, NCCL _PRO| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), TO_SI MPLE, | 2) | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611 :62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | | Run warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3WorkBa tch, algo, proto, epSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | unr oll>(p).run(r); \ i | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hm:670:15:s note: field 'nthreads' will be initialized after field 'tidInBlock' 670( | ttid(tidi), nthdreads(n-threadns), titdInBlohck(threradIdx.ex), graoup(grdoup), s| ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:S670plit, nthreads-nthreadsSplit, &tree->up, t:60:r note: field 'group' will be initialized after field 'stepSize' 670e | teid(ti-d), nth>readsd(nthreoads), tidIwn, work->sendbuffnBlock(threadIdx.x), group(grou, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().ruIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:n(tid, subtn, w15: warning: initializer order does not match the declaration order [-Wreorder-ctor] o 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_f8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | DEV_ARITY, 1>, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_2, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=In file included from */0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work)/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: ;In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid| (tid), ^ nth re/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hads(nthreads), t:idInB432lock(threadIdx:.x)78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl<, Fgroupn(grou,p), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | T tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | , stepSRize(setepSidze_ =O= 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/p, Algo, Proto, COLL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_2, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TRsizeoEf(T) E: st,epSi ze_) N{ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~C | group(group C/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254L:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254_ | P PrimRitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_2, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ :565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthe<1,r 1, CeOLL_UNaROLL>d, COLsL_UNR(OLL>(tnid, ntthreadhs, worrk)e; | ^a /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:d432:78: snote: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here )432 | , if (tid tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ < subtn) RunW/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.horkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp note: field 'group' will be initialized after field 'stepSize': 7670 | : 1tid:(ti d)note: , nin instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested herethr ead s(nthre7ads | )D, tEidIFnBlIockN(tEhreadIdx.x), group(group), | ^~~~~~~~~~~ _ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_eUads), tidNInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_2, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.xROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto,), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_2, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:r11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(groupeads(nthreads), tidInIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>() /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_f64_2, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ Block(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm..run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_2, ncclIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nFuntcAllReduhce, FuncrPreMulSuem, rccl_faloat8, NdCCL_ALGOs_TREE, N(nthreads), tidInBlock(thrCCL_PReOTO_SIMaPLE, 2d) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hI:611:62:d note: expanded from macro 'DEFINE_ncclDevFunc' x611 | . RunWoxrkBatch), arloup(go, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid),group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: L>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_2, ncclFuncAllReduce, FuncPreMulSum, doublefield 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_f64_2, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ , NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().ruIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_2, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ n(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_2, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreaIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthds), tidInreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ ==Block(th 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group readIdx.x)/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims , group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | ste | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_2, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) pSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEP S| ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work-> | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, rsecvbuff, | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_f64_2, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ ubtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunsW), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_2, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ p, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), n), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizetshreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ [NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_2, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nto, COLL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_2, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ cclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_2, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidIIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_PreMulSum_f8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ nBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x),/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock( t| ^ hreadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl():565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkCollnote: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here (12 | D)EFIN.E_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_2, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h Run:Work670Batc:h, alg o, proto670, un | roll >(). run( ); \ | ^t /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:i670:15: dnote: field 'nthreads' will be initialized after field 'tidInBlock' (tid), nthreads 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60:(nth readnote: s), tfield 'group' will be initialized after field 'stepSize'idIn Blo ck(threa670dIdx | .x), gro up(gr/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_2, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), grouads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:p(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5:/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:7:1: 15note: : warning: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested hereinitializer order does not match the declaration order [-Wreorder-ctor] 670 | ti7d(t | id)D, nEthrFeadIs(nthreads), tidInBlock(threadIdx.xNE_)ncc,lD evFguroup(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCnIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthrCe, FLunc_PreMulSSum,T doEubPle,S NC/CL_sALGiO_TzREEe, NoCCLf_PR(OTOT_SI)MPL E,: stepSize_) 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proteads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> primso , unr| oll ^>( ).ru/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hn(); \: | 565 ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h::670:515:: note: field 'nthreads' will be initialized after field 'tidInBlock' note: 670 | in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here ti d(tid565), | nth rea ds (nt hreards)u, ntidTInBrloeck(ethrUepDown, COLL_adIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), groUNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subutp(grnoup,) , w| ^~~~~~~~~~~ ork); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_2, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads),/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads) tid, tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f64_2, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ InBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group)/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here , | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_ 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (t:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreadsi)d < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_2, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthread, tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : ss(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIM:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_S tid(tid), nthreads(nItMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:7:1hreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stPLE_PreMulSum_f8_4, ncclFuncAllReduce, Fu:n note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_2, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ cPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nt) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: hreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_2, ncclFuncAllnote: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | PrimReduce,i FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdtives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | RedOp, Proto, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here :670:15: Primitives, /*Direct=*/0, Proto, 0> prims | ^ note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(th/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ readIdx.x), group(group), x.x), group(group), | ^~~~~~~~~~~ | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 432 | if (tid < subtn) RunWorkColl, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBOp, Algo, Proto, COLLatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthr_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:12eads),: tidInBlock(threadIdx.x), group(grou1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunpc), | ^~~~~~~~~~~ (AllReduce_RING_SIMPLE_PreMulSum_f64_2, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | ), tidI nBlo ck(t hreasdIdxt.x), egroupp(groSup),i | ^~~~~~~~~~~~~~~~~z /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:e670:60: (note: field 'group' will be initialized after field 'stepSize's 670 | t etid(ptid),S nithrezads(enthr_eads ), t=idIn=Bloc k(th0read ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPSIdx./x), gsroupi(grozup),e | ^~~~~~~~~~~ of(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_2, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(thr/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.headIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlo:c670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f64_2, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ k(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f64_2, ncclFuncAllReduce, FuncPreMulSum, doub/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hle, NCCL_ALGO_RING, NCC:670:15L: warning: initializer order does not match the declaration order [-Wreorder-ctor] _ 670P | tRid(tOid),T O_SIMPnthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlepSioze_ c== 0k ? n(cclSthmemh.comrm.bueffSiazes[dNCCLI_PRdOTO_SxIMP.LE]/xNCCL)_STE,PS/s izeogf(T)r : sotepSiuze_p) { (| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ g| roup), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | r 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ unRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f64_2, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(g/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreadsroup), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ (nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_2, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15 : warning: initializer order does not match the declaration order [-Wreorder-ctor] 670: | tid (tid), ntshreads(ntthreads), etidIpnSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | Block(t group(grouphreadIdx. /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitivesx), gro, 0, ProtoepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : s, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | iftep Size_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~( | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.ht:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested herei 303 | d Primi tives,d /*Op, Algo, Proto, Direct=*C/0, ProtOo, LL_U0> primNROLLs | ^>(). /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:run(tid565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f64_2, ncclFuncAllRReOLL>,d COLLu_UNROLcL>(tied, nthr,eads, work)F; | ^u /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:n432:78: note: cin instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here P432 | r eMulSum, dou b if (ltid ().ru611n(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDev | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tFuinc(AldlRed)uce_,T nthreads(nthreadREEs_SIM)PLE,_PreM ulStum_f8i_4, dncclIFuncnAllRBeducle, FouncPrceMulkSum,( rcctl_flohareadIdx.x), grto8, NuCCL_pALGO_(TREEg, Nroup), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hCCL_PR:OT670:60: note: field 'group' will be initialized after field 'stepSize'O_ SI MPLE, 6704) | | ^ tid(tid), nth /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>(reads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ).run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nth 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_4, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlockreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ (threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f64_2, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f64_2, ncclFuncAllReduce, FuncPreMulSum, do/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_4, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ uble, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.ht:670:i15: warning: initializer order does not match the declaration order [-Wreorder-ctor] d 670 | ( tid(ttiid), nthreads(nthreads)d,), nth reads(ntthreadis), tiddInInBlock(threadIBldx.x), group(group), oc| k(thr ^~~~~~~~~~~eadIdx .x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_4, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFI/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(thr:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x),eadIdx .x), ggNE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ roup(grroup), | o ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ u 671 | p step(Size(gstepSirze_ ==o 0 ? ncuclShmepm.comm).buf, | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | sftSizes[eNCCL_PpROTO_SSIMPLE]ize(stepSize_ == 0 ? nccl/NCCSL_STEhPS/sizmeof(T)e : stepm.comSmize_).buffS {i | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ zes[NCCL_PROTO_SIMPLE]/N | C group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hC:254:90: note: Lin instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | _ PSriTEmPitS/sizeof(T) :ives< T, RedOp, stFaepnAsSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | rymmuetricg, /*Dire, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ */0T, Proto,, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h :565:5:R note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | e rudnTreeUOpDownp, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | Op, P rotoSProto, implCOLL_UNROLL>(tid, e<1, n1, COtLL_UNhROLL>,r COLLe_UNROaLL>(tdid, nsthrea,ds/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f64_2, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , w owork); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:r78k); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432::78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested herenote: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 432 | if (tid < subtn) RunWorkColl if( (tid) < su.btn) rRunWourkColnl, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ p, A:lgo, 22Proto:, CO1LL_U:NROLL >().rnote: un(tidin instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(All, suRbteduce_RING_SIMnP, woLrk)E_P; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:r17:1: enote: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here M17 | DuEFINlE_ncSclDeuvFunmc(Al_lRedfuce_8TREE__SIM4PLE_,PreM ulSunm_f6cclFuncAll4R_4, enccldFuncuceAllRe, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:duce62, :Fun cPrenote: MulSexpanded from macro 'DEFINE_ncclDevFunc'um, do uble, N611CCL_A | LGO_ TREE , NCC L_P ROTOR_SIMPuLE, n4) W| ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.ho:611:r62: note: kexpanded from macro 'DEFINE_ncclDevFunc' B611 | a RunWtorkBcath, algo, procth().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:re670dop:15, al:go, protnote: o, ufield 'nthreads' will be initialized after field 'tidInBlock'nrol l>() .run(); 670\ | | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h :670: 15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(ttid(tid)id),, nt hr/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | nthreadeads(nths(nthrea/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] ds), tidInBlo 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ck(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670re | ad s), tid InB lock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nttid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(hgreards),o tiduInBlopck(t)hre,adId x.x) , gr| oup(group ^~~~~~~~~~~ ), | ^~~~~~~~~~~ if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f64_2, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direc/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.ht=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runT:670r:15: warning: initializer order does not match the declaration order [-Wreorder-ctor]e 670 | e tiUd(tid)p, Down, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, suboup), t | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | n tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | , st epSizew(stepSoize_ =r= 0 ? nkcclShme)m.com;m.buffS izes [NCCL_| PROTO_ ^SIMPLE ]/NCCL/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp_STEPS/sizeof(T): : ste17pSize_:) { | 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254: 90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here note: 254 | in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here Primit ives<17 | DEFINE_ncclDevT, RedOp, FanAsymmetric, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_4, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ _f64_4, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads),In file included from tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ =17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ = 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE)/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBloPcrimitivkes, /a*Direct=*d/0, Proto,I 0> prims d | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hx:.x/WARP_SIZE)/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_4, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(gro565,:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE | runTree Up508 | flaDown, COLL_UNROLL>(tid, nthr3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSiup), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ zes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , 670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] F670 | tida(tid), nnthreads(nSthreadys), tidmInBlockm(threadIedx.x), gtroup(grourp),ic<1>, 0, Prot | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | o tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | , st epSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPeSads, w/ork); s | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hi:432:z78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here e 432 | o if (ftid < (subtn)T RunWo)rkColl ().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Pr0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: pSize_) {in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:43290: note: | in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Pri miti ves< T, RediOp, fFanAs ymme(trict,s /*Duirectb=*/etMu0nlSum),_f8 _PR2, runcconlFutWncAoollR,redu kce,0C Fu>oncP lreMplulSr, ProtoSimple<1, 1, 4>, 4>' requested here| ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hP : r611:62:o note: 565expanded from macro 'DEFINE_ncclDevFunc' t | 611 | o , Ru n WoCr kBOartcLhu().run(treeUpDown, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: id, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_PreMulSum_f64_2, ncclFuncAllReduce, FuncPreMulSum, double,_UNR OLL>N, COCLL_UCNROLLL>(_tid,A nthLreadsG, woOrk); _ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hR:432I:78: Nnote: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here G432 | , if N(tidC < sCubtnL) Ru_nWorPRfield 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ OTO_LL128, 2) kColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(Al | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ lR ed| uc ^e_ TREE_SIMPLE_PreMulSum_f64_4, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduc warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidIneB_TREE_SIMPLE_PreMulSum_f8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ lock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | _UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_4, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_ tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]T/REE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < 12 | DEFINE_snubtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_4, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ cctid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_4, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ lDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f64_2, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, pr/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | c<1, NC CL_MAX_DEV_ARITY>, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_4, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthrea tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), grods(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ up(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here :670 :15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | t303id(tid) | , nthrea ds(nthr eads), t idInBlo ck(thre adIdx. x), grouPp(groupr), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ imitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:ROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0:,670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | s Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runT78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatchOLL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllRedurceeUepDown_().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ IUNROLLM>, PCOLL_LUNROELL>(_tid, Pnthrerads, weork);M | ^ u/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hl:432:78:S note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here u432 | m i_f (tf64_4, nid < subtn) RunWorkColledu(ce,) Fu.ncPrreMuulSnum, (doutblie, NdCCL,_AL GsO_uTREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, btnu, wnorkr); o | ^ l/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cppl:17>:1:( note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here ) 17 | DEFI.NE_rncculDenvFu(nc(A)llRe;duc e_T\REE _SI MPL| E_P ^reM ulS/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hum_f64_4, :ncc670lFu:ncA15llR:edu ce,note: Fufield 'nthreads' will be initialized after field 'tidInBlock'ncP reM ulSum, 670dou | bl e, NCCL _A tid(tid),LGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch< nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreacoldl, sty,) re,dop t, ailgod, pIrotno, Bunrlollo>()c.runk()(; \t | h ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hr:e670:15a: note: field 'nthreads' will be initialized after field 'tidInBlock' 670dIdx.x), group(group), | ^~~~~~~~~~~ | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_4, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidIn/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives,Block(threadIdx.x), gr 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f64_4, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.houp(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCC(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(L_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_4, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ _SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hw:o670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] r 670 | tkid(tid),) nthreads;(nthreads ), tidInB lock(th| ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevreFadIdx.ux), gronup(grocup), | ( ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | A tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | l stepSlize(stRepSizee_ ==d 0 ? ncuclShmecm.come_RING_SIMPLE_PreMum.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeoflSum_f8_2, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().r{ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, CO/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hLL_UNROLL>(t:id, 670nthr:eads60, wo:rk); | ^note: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hfield 'group' will be initialized after field 'stepSize':432:78 : note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | 670 if | (ti d < subt n) Ru nWortkid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), Collg()p.run()tid, ,subtn , w ork| ); | ^~~~~~~~~~~ ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp :17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(:670:15t: warning: initializer order does not match the declaration order [-Wreorder-ctor]h r670 | e taid(tdid),I nthdreadxs(nt.hreaxds),) tidI,nBlo ck(thgreadrIdx.ox), ugroupp(gr(oup),g | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~roup), | ^~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthread runs), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_4, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f64_4, ncclFuncAllReduce, FuncPreMuly, redop, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: Sum, field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ doubl/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f64_4, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/size, NCC, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | L_ALGO_RI tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ NG, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreeof(aT) d: stepSisze_)) {, | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | t group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hi:63:56:d note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested hereI 63n | B Prlimiotivecs, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | dIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f64_4, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_R/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ProtoSimple<1, 1, COLL_UNROLL>, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STE:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), n 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(thread(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ I=dx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ = 0 ? ncclShmem.comm.buffSPS/siizeof(T) z: stepSieze_) { | s ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h[:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here N 303 | C PrimitiCves, /*DOirect=*/_0, ProtoS, 0> priIms | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hM:565PLE]/NCCL_STEPS/sizeo:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, f(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f64_4, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DE/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] FINE_ 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_4, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | Run/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(ntWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreahdreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f64_4, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threas(nthreads), tidInBlock(threadIdx.x), group(grdIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ oup), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f64_4, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunW/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.horkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NC:670:15:C warning: initializer order does not match the declaration order [-Wreorder-ctor] 670L | tid(tid), nthreads(nthreads)_, tidInBAlock(thLreadIdxG.x), grOoup(g_roup), T | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_R 671 | E sE, NCCL_PROTO_SIMPLE, 4) | ^tepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h(:611:62T: note: expanded from macro 'DEFINE_ncclDevFunc' )611 | Run:Work Batchs,S algo,i proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:ze_) 60{ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~: | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h :254:90: note: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | field 'group' will be initialized after field 'stepSize' Primitiv es, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, Cd(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ OLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] :78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < s 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ubtn) Ru/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group),nWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_4, ncclFuncAllReduce, FuncPreM ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_ | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_4, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSiz/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.he:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.s[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_4, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | NROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllRe ^~~~~~~~~~~d uce_TREE_SIMPLE_PreMulSum_f8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60:/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_4, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f64_4, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ >().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run();/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h, algo, prot:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0o, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPL 670 | tid(tid), nthreads(nthreads), tiE, 4) d | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ InBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f64_4, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f64_4, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthre a| ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ds(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? nccl 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Shmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_f64_4, ncclFuncAllReduce, /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCRL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ OLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffS(tid), nthreadsizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f64_4, ncclFuncAllReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PR(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ OTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_f8_4, ncclFuncAllReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for gfx1030. 18 warnings generated when compiling for gfx1201. 18 warnings generated when compiling for gfx942. 18 warnings generated when compiling for gfx908. 18 warnings generated when compiling for gfx1102. 18 warnings generated when compiling for gfx1200. 18 warnings generated when compiling for gfx1100. 18 warnings generated when compiling for gfx1101. 18 warnings generated when compiling for gfx906. 22 warnings generated when compiling for gfx90a. [ 62%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_u32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_u32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_u32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_u32.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:1In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:1514: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: : warning: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: unused variable 'y' [-Wunused-variable] /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:1877 | : warning: unused variable 'y' [-Wunused-variable] uint32_t y, head, mantissa; | ^ 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = tIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: hreadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ : warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | cIn file included from o/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hn:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175s: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19:t warning: unused variable 'ptr' [-Wunused-variable] 271 | i uint6n4_t* pt w = threadIdx.x/WARP_SIZE; \ | ^ tr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ 29 | const int w = tIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.hhread:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | Id x.x/ WARP_S IZE; \ | ^ uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uin/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218t32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:chann:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ elLo;11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrie | ^~~ r_by_group(); | ^~~~~~~~~~~~~~~~~~/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | cons /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.hIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); :| 29 ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :15: note: expanded from macro 'barrier_by_group' 29 | t int bid = ncclShmem.channelId - work->channelLo; | ^~~ const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] In file included from 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1,/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | co flag1, data2, flag2; | ^~~~~ nst int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint3In file included from 2_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271In file included from :19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group()bid = nc; | ^~~~~~~~~~~~~~~~~~c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ lShmem.channelId:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:elId - w15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ ork->channelLo; | ^~~ - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ :218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_2, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_2, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_2, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_2, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tidIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_2, ncclFuncAllReduce, Fun, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_2, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algcPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ o, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_2, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_2, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Pr/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_2, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ eMulSum_u32_2, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncc/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hlShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STE:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] PS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives stepS,ize(st e/*Direct=*/0, ProtpSize_o == 0 ? ,ncclShm 0>em. prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hcomm.buffSizes[:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | NCCLr_PROuTO_SIMnPLE]/NTreeUpCCL_Down, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | ) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h :303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here i303f | Pri(mitivtes,) /*Di rectRunWorkCol=l* p,r RedOp, Algo, Proto, COLL_imsU | N ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:R565:5:O note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | L L>().run(tid, subtn, wor rkunT)reeUp;Down, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_2, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ L_UN/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hROLL>, COL:L_U611NROLL:>(tid62, nth:reads , wonote: rk); expanded from macro 'DEFINE_ncclDevFunc'| ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h :432:78 : note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | 611 | if ( tid < subt n) Ru nWorkRCollu, aeldOpg, Aol,go, Prpoto, COLL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDeroto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), vF unc| (Al ^~~~~~~~~~~~~~~~~lRe duc/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.he_TREE:_SI670MPL:E_P60r:eMu lSunote: m_ufield 'group' will be initialized after field 'stepSize'32_2 , n cclFun670cAl | lRe duc e, F unc PretMuliSumd, u(intt32_ti, NdCCL)_AL,GO_ TREnE, tNCChL_PrROTeO_SaIMPdLE,s 2) ( | ^n /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.ht:611:h62: rnote: expanded from macro 'DEFINE_ncclDevFunc' e 611 | a d RusnWo)rkB,atc h, algo, proto, unroll>().r | ^~~~~~~~~~~ un(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_2, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_2, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u32_2, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatchIn file included from :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ omm./builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: buffSizIn file included from eimple<1, 1, COLL_UNROLL>, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_2, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ s[NCCL_P/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hROTO_SIM:PLE]/NCC11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:L_STEPS/sizeof(T) : steIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { 15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_u32_2, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? nccpSizle_) { S| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hh:63:56m: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63e | Primitivesc, 0, Protoo, 0> pmm.buffSizes[NCCL_PROTO_SIMPLE]/Nrims | C ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:C5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here L558 | r_unRingS(tid, n/threads,s work);i zeof(T) : s | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432tepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> pri | imf (tid s< s | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u32_2, ncclFuncAllReduce, FuncPreMulS, ProtoSimple<1, 1, COLL_UNROLL>, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | um, uint3 2_t, N CCL_A LGO_R ING, NCCL_ PROTOi_SIMPfLE, 2 ) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:(611:62:t note: expanded from macro 'DEFINE_ncclDevFunc' i611d < subtn) RunWorkCollL, a_lgoU, pNroRto,O unLrolLl>()>.ru(n()); \. | r ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hun:670(:15:t note: field 'nthreads' will be initialized after field 'tidInBlock'i d670 | , tidsubtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:7:(tid), nthreads(nthreads), tidInBl1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_2, ncclFocukn(thcreaAdIdlx.xl), Rgrouep(gdrouup), c| ^~~~~~~~~~~~~~~~~e /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h,:670: 60: Fnote: ufield 'group' will be initialized after field 'stepSize' n670 | c Ptid(tid),r ntehreMadsu(ntlhreSads)u, tmid, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>(/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] ).run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h: tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_2, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u32_2, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSi z e(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, wortid(tid), nthreads(nthreads), wid(tid%WARP_SIZEk); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u32_2, ncclFunc), warp(tid/WARP_SIZE), AllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(gro| ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ up(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_u32_2, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from : warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_2, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_u32_2, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u32_2, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173tid(tid): , nthread/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hs(nthreads:), tidInBl670ock(threa:dIdx.x), 15group(gr:oup), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ warning: 671 | steinitializer order does not match the declaration order [-Wreorder-ctor]pSize(ste pSize_ 670 | tid(tid), nthreads(nthreads == 0 )? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlo,c tidInBklock(thre(adIdx.xt), group(ghroup), | r ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671e | stepaSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_U | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ N671 | R sOtepSizLe(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS / 254 | s i Primiztivese,s /*Ditrect=e*/0, pProto,S 0> piriL>mz(tidse, nt h_rea )ds, w | ork) ^{; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h| :432:78: ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ note: | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, 0, 2, 2>::run' requested here L 432 | L >, COLL_UNROLL>(tid, nthreif (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncRedcOp,l FanDAsyemmevtriFc,R /*eDdireuct=c*/0e, P_roto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, 0, 2, 2>::run' requested here1 ,432 | 1 , if (tCid O< sLubtLn) _RunUWorNkCoRllOp,, Al go,C PrOotoL, CLOLL__UNUROLTNREE_SIMPLE_PreMulSum_u32_2, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ROLL>(tid, nthreads, worLk>()).;ru n( t| id ^, su/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hbtn, wor:k)432; : | 78 ^ :/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp :7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevnote: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested hereF u432 | n c ( Aifl (ltReduce_TREE_SIMPLE_PreMulSum_id < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSumu32__2u, 3nc2cl_Fu4nc,Al lRnedcucce,l FFuunncPcreAMulllSRume, duiuntc32e_t,, NCFCL_uALnGOc_TPREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | reMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611ti:d(62ti:d) , note: ntexpanded from macro 'DEFINE_ncclDevFunc'hr ea ds(n611th | read s) , tiRdIunBnloWcko(trhrkeaBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidIdIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ nBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u32_2, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), gro/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hup(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_4, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u32_2, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), ti/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hdInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.com:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ m.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Prim:670:15i: warning: initializer order does not match the declaration order [-Wreorder-ctor] t670 | i tid(tidv), nthreeads(nthreads), tidInsBlock(t, 0, Proto, 0> prims | ^= 0 ? ncc/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hlShmem.comm.buffS:izes[N558CCL_PR:O5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(ti)d, nthr,eads, wor k); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h| :432:78: ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (| tid < tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_subtn) Ru nWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFSIize_N) { E | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ _| group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hnc:254c:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | lDePvFrunc(iAllRmeducei_RINGt_SIiMPLEv_PreeMLL>, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_4, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ s, /*Direct=*/0, Proto, 0> primsLGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, al | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkCgoo, plrotol, un().,run ();T \ | ^ ,/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' R670e | d tOid(ptid), nth,rea ds(Anthlreagds)o, ti,dIn BloPckr(thoreadtIdxo.x),, g rouCp(gOrouLp), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdxL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_4, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | .x), group(group), | ^~~~~~~~~~~ ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_4, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_2, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hgrou:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tp), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here X_DEV_ARITY, 1>, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here303 | Pr imitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNRO iLf (tLid <> sub(tn) tRunWiorkCodll().drun(tsid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u, w3ork)2; | _ ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:478: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here, 432 | n icf (tcid ().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u3m, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u32_4, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock2_(2, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algthroea,dI dx.x), pgrrouop(gtrouop),, | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hu:670n:60r: note: ofield 'group' will be initialized after field 'stepSize' l670 | l >().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ), group(group), | ^~~~~~~~~~~ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_4, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_4, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), grou, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Pr); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_4, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreeads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ MulSum_u32_2, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_4, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_4, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_4, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_4, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_PreMulSum_u32_2, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u32_2, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u32_4, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), gro/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hup(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_4, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(:670:15/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid:), nthreads(nthreads),670 tidInBlo:ck(thrteid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ a15d: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | Idx. x), group ( tigroupd(tid)),, | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ nthreads(nthre | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | a steds), tidInBlockp(Size(stepthreadIdx.x), gSizer_ == 0 ? onccup(group), lShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) {s tepSiz e_) { | | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | P| rimitiv group(groupes, 0, Proto,:303:90: note: 0> pin instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested hererim 303 | Primitives, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | DEV_ARITY>, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown < Tif (ti,d < s ubtn)R RunWeorkCodll COLL,_UNRO LL>COLL_UNROLL>(tid, nthreads, work()).ru;n(ti d, s ubtn| , wo ^rk); | ^/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here : 432:78: note: 22 | Din instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested hereEFIN 432 | if (tid < subtn) RunWE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u32_4, ncclFuncAllReduce, FunorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:17:1: note: cPin instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested herere MulSu m, uint3217_t, | NCCLD_ALGEO_RIFNG,I NCCNL_PROETO_SI_MPLncclDevFunc(AllReduce_TREE_SIME,PLE 4_) P| ^ r/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.he:611:M62: unote: expanded from macro 'DEFINE_ncclDevFunc' l 611S | u Rumn_WorukBa3tch2,, al go, prNoto,C uCnroLll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthr_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch,e adsa(ntlhregadso), ,tid InBplocrk(tohretaodIdx,.x) , gurounp(grrouop),l | l ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h>:670:(60: )note: .run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' field 'group' will be initialized after field 'stepSize' 670670 | | t id (ti d), ntthrieadds(n(thrteadis),d ti), nthreads(ntdhInBrlocke(thareaddIdsx.x)), , tgiroudInBlock(threadIdx.xp(group), | ^~~~~~~~~~~ ), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_4, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_4, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u32_4, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u32_4, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u32_2, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u32_4, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u32_4, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SI/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here MPLE]/NCCL_STEP 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u32_4, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ S/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_4, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_4, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_4, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_4, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u32_4, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u32_4, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u32_4, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for gfx942. /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u32_4, ncclFuncAllReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for gfx1100. 18 warnings generated when compiling for gfx1101. 18 warnings generated when compiling for gfx1102. 18 warnings generated when compiling for gfx942. 18 warnings generated when compiling for gfx1030. 18 warnings generated when compiling for gfx908. 18 warnings generated when compiling for gfx906. 18 warnings generated when compiling for gfx1201. 18 warnings generated when compiling for gfx1200. 22 warnings generated when compiling for gfx90a. 22 warnings generated when compiling for gfx90a. [ 62%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_u64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_u64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_u64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_u64.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t dataIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from 1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ :366:15: warning: unused variable 'bid' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1,In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp :flag1, data2, flag2; | ^~~~~ 2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable]In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flagIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ : warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nt h670r | e a ds ( nttihdr(etaidsd)),, tnitdhIrneBaldosc(kn(tthhrreeaaddsI)d,x .txi)d,I ngBrlooucpk((gtrohurpe)a,d I d| x ^~~~~~~~~~~. x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_4, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_4, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_4, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_4, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_4, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_4, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_4, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_4, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_4, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0 , 670P | r ot o , t i0>d( tpiridm)s, n| t ^h reads(n/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.ht:h565r:e5a:d snote: )in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here, tidIn B565lo | ck ( thrruenTardeIedUxp.Dxo)w,n i,z eC(OsLtLe_pUSNiRzOeL_L >=(=t i0d ,? nntchrcelaShdmse, mw.ocrokm)m;. b u| f ^f Sizes[NCCL_/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hP:RO432:T78O:_ Snote: Iin instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested hereM PLE]/NC C432L | _S T E PS / siifz e(otfi(dT )< :s usbttenp)S iRzuen_W)o rk{ C o| ll ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~< F | n group(group, T, RedOp, Algo, P/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hr:o254t:o90,: Cnote: Oin instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested hereL L_UNROLL >254( | ) . ru n ( tPirdim,i tsuibvtens,< Tw,o rRke)d;O p ,| ^F anAsymmet/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_4, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ric, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_4, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hwarning: :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | initializer order does not match the declaration order [-Wreorder-ctor] ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here670 | ti d432( | t i d ) , nitfh r(etaidds <( nstuhbrtena)d sR)u,n WtoirdkICnoBlllo ().run(t id671, | s u b tsnt,e pwSoirzke)(;s t e| p ^S ize_ == 0 /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp?: 22n:c1c:l Snote: hin instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested herem em.comm.b u22f | fDSEiFzIeNsE[_NnCcCcLl_DePvRFOuTnOc_(SAIlMlPRLeEd]u/cNeC_CRLI_NSG_TESPISM/PsLEi_zPeroefM(uTl)S u:m _sut6e4p_S4i,z nec_c)l F{ u n| c ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~A l l| Re group(groupd uce, FuncPreMulSum, uint64_/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.ht:, 303N:C90C:L _note: Ain instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested hereL GO_RING, N303C | C L _ P R OPTrOi_mSiItMiPLvEe,s <,c o/l*lD,i rteyc,t =r*e/d0o,p ,t oa,l g0o>, pprriomtso , | u ^n roll>()./builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hr:u565n:(5):; note: \in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here | ^ 565 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h :r670u:n15T:r enote: efield 'nthreads' will be initialized after field 'tidInBlock'U pDown< T670, | R e d O pt,i dP(rtoitdo),S inmtphlreeI,n BClOoLcLk_(UtNhRrOeLaLd>I(dtxi.dx,) ,n tghrroeuapd(sg,r owuopr)k,) ; | ^~~~~~~~~~~~~~~~~| ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize'/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h :432:78: note: 670in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here | tid (432t | i d) , n t hirfe a(dtsi(dn t().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_4, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u64_4, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_4, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_4, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_4, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthread_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_4, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ s(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u64_4, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor]/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u64_4, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, al go506, | p r o ttoi,d (utnirdo)l, ln>t(h)r.eraudns(()n;t \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' h r670e | a d s )t,i dw(itdi(dt)i,d %nWtAhRrPe_adSsI(ZnEt)h,r ewaadrsp)(,t itdi/dWIAnRBPl_oScIkZ(Et)h,r e a| d ~~~~~~~~~~~~~~~~~~Id x | .x stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) , grou p507( | g r o u pw)a,r p I| n ^~~~~~~~~~~~~~~~~B loc/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hk:(670t:60h:re anote: dfield 'group' will be initialized after field 'stepSize'I dx.x/ W670A | R P _ S ItZiEd)(,t i d| ) ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~, n| t warp(tid/WARP_SIZEh reads (508n | t h r e afdlsa)g,T htriedaIdn(B(ltoicdk%(4t)h=r=e3a)d,I dgxr.oxu)p,( ggrroouupp)(,g r o| u ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~p ) ,| warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 | ^~~~~~~~~~~ 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_4, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h18 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u64_2, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_4, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_4, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().r u670n | ( ); \ t i| d ^( tid), n/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.ht:hr670:e15a:d snote: (field 'nthreads' will be initialized after field 'tidInBlock'nt hreads )670, | t i d ItniBdl(otcik(dt),h rentahdreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Idx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u64_4, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u64_4, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u64_4, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u64_4, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u64_4, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_4, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_4, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u64_4, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u64_4, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u64_4, ncclFuncAllReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for gfx1201. 18 warnings generated when compiling for gfx1100. 18 warnings generated when compiling for gfx906. 18 warnings generated when compiling for gfx1200. 18 warnings generated when compiling for gfx1101. 18 warnings generated when compiling for gfx1102. 18 warnings generated when compiling for gfx908. 18 warnings generated when compiling for gfx1030. [ 62%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_u8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_u8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_u8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_u8.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uiIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ nt32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work-/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ >channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29In file included from :15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_2, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_2, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_2, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_2, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_2, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_2, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->dE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ own, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_u8_2, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_2, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_2, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_u8_2, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_2, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : sIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_PreMulSum_u8_2, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ tepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u8_2, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_2, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u8_2, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_2, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_2, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_2, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_2, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_2, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_2, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_2, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_2, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_2, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_4, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_4, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_2, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u8_2, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u8_2, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_2, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | DEV_ARITY>, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_2, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u8_2, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u8_2, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u8_2, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u8_2, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_4, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_4, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_4, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(th/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ readIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u8_2, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u8_2, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_4, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_P/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_4, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_4, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ reMulSum_u8_4, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_PreMulSum_u8_2, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u8_4, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u8_4, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_4, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u8_2, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_4, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_4, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_4, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_4, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' T, RedOp, Algo, Proto, COLL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_4, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_4, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_4, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_4, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u8_4, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u8_4, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_4, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(t/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u8_4, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthhreadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u8_4, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ reads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u8_4, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u8_4, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_4, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_4, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_PreMulSum_u8_4, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u8_4, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u8_4, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_premulsum_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_PreMulSum_u8_4, ncclFuncAllReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for gfx1101. 18 warnings generated when compiling for gfx1100. 18 warnings generated when compiling for gfx1030. 18 warnings generated when compiling for gfx1200. 18 warnings generated when compiling for gfx942. 18 warnings generated when compiling for gfx1102. 18 warnings generated when compiling for gfx1201. 18 warnings generated when compiling for gfx906. 18 warnings generated when compiling for gfx908. 22 warnings generated when compiling for gfx90a. [ 62%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_bf16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_bf16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_bf16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_bf16.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable]In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.hIn file included from :175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ : /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ :218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ :366:15: warning: unused variable 'bid' [-Wunused-variable] /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_4, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor]In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_4, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RIIn file included from NG_SIMPLE_Prod_bf16_2, ncc/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpplFuncAll:Reduce,2 FuncPr: od, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_4, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_4, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor]: 670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthrea ds(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf16_4, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_4, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hgroup), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_4, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf16_4, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_4, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInB/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hlock(:670:15: warning: threadIdx.x),initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ group(group), | ^~~~~~~~~~~/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_4, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCC/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_4, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ L_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | : note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_4, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadI/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_4, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ dx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf16_2, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf16_4, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives,a 0, Prodto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf16_4, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Idx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_4, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TRE/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | PrimitiE_SIMPLE_Prod_bf16_4, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ves, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_4, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_4, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_4, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_4, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_4, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf16_4, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf16_4, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_4, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_4, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf16_4, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf16_4, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_4, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf16_4, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf16_4, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf16_4, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf16_4, ncclFuncAllReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for gfx1100. 18 warnings generated when compiling for gfx1201. 18 warnings generated when compiling for gfx1200. 18 warnings generated when compiling for gfx1101. 18 warnings generated when compiling for gfx908. 18 warnings generated when compiling for gfx1030. 18 warnings generated when compiling for gfx1102. 18 warnings generated when compiling for gfx942. 18 warnings generated when compiling for gfx906. 22 warnings generated when compiling for gfx90a. [ 63%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_bf8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_bf8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_bf8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_bf8.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ :218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - w/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ ork->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h::11218:15: warning: unused variable 'bid' [-Wunused-variable] 218: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ | const int bid = ncIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ clShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const iIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ :218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hnt bid = ncclShmem.channelId - work->channelLo; | ^~~ :366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_4, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_4, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ duce_RI 671 | N steGpSize(s_tepSizeS_ == 0 ?I ncclShMmem.comPm.buffSizLes[NCCLE_PROTO__SIMPLE]P/NCCL_STEPS/sizeorf(T) od_bf: st8epSiz_2, ncclFuncAllReduce, FuncProde_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ c, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | t, 1>i, /*Ddirec(t=*/0t, Protio, 0d> pri)ms | , ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h :565:5: nnote: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here t565 | h rurnTreeeUpDowna, CaOLL_UdNROLLs>(tid), nt, threads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (idInBtlock(ithrdeadIdx .x), , FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ oup), | ^~~~~~~~~~~ kColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_4, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tiIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLLd), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ L>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ CL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_4, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), gro: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(, ProtoSimple<1, 1, COLL_UNROLL>, COLL_UNROLtid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ L>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ T, RedOp, Algo, Proto, COLL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_4, ncclFuncAllReduce, FuncProd, rccl_/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hbfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' :670670:15: | warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | ti d(tid) , nthre ads(nthtreads), itidIndBlock(th(readIdxt.x), groiup(groudp), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~) | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ ,671 | stepSizen(stepSitze_ == h0 ? ncclrShmem.ecomm.buaffSizes[dNCs(nthCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | ize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UN R Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_4, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ OLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthread(grs), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tidoup), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(ti, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_4, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ d), nthreads(nthreads), tidInBl tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_4, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSi:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[zeN_) { C | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | C group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:L63:56: note: _in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here P63 | RPrimiOtives, 0M, ProtPo, 0> Lprims E | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h]:558:5:/ note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here N558 | C runRiCng(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, C 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf8_2, ncclFuncAllReduce, FuncPOLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorrokd, rCccl_obflolat8,l NCC().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tiSIMPLE_Prod_bf8_4, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWodIrnBlkockB(thareadIdx.tx),c grhouds),, ti dInaBlolck(gthroead,Idx .x),p grroupo(grtoupo), , | ^~~~~~~~~~~ unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf8_4, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf8_4, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf8_4, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(thre/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hadIdx.x), gr:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^oup(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primit /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf8_4, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().rtuid), nnthre(ads(tid, subtn, work); | ^nt hread/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpps), tidInBlock:(thre17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here adIdx.x17), | DEFINgroup(groupE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_4, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE),, | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | 4ste) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBapSitze(sctepShize<_ ==c 0 ?o nclclShlmem.,comm .butffSyizes,[NCC L_PRrOTeO_SIdMPLEo]/NCpCL_Sf(T), : st epSaizel_) {g | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~o | group(group, /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h :254:90p: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested herer 254o | t Proimit,ives().run(); _D\EV_A RITY , 1>| , / ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: *Difield 'nthreads' will be initialized after field 'tidInBlock'rec t=*/ 0, P670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670r:ot60o,: 0> prnote: imsfield 'group' will be initialized after field 'stepSize' | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: 670note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here | 565 | r unT reetUpDiownda, CdOLLs_UN(ROLnL>(ttidh, nrthreeaadsd,s), tid IwornkB); l | ock(threadIdx.x), group(gr ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (toup), | ^~~~~~~~~~~ id < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_4, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInB/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_4, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ lock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ to, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf8_4, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ dOp, FanAsymmetric<1, NCCL_MAX_DEV_ARITY>, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_4, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_4, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(gro/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hup), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_4, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(thre/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? nccadIdx.x), group(group), | ^~~~~~~~~~~ lShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_4, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_Prod_bf8_2, ncclFuncAllReduce, FuncProd,/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIM rccl_bfloat8, NCCL_ALGO_RING, NCPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf8_4, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611id), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDe/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_4, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ vFunc(AllReduce_RING_SIMPLE_Prod_bf8_2, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads),/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prko(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_4, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ oat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch,/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof( alTgo, )prot o, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ : step/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hSize_) :{ | 670 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | : group(group 60/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h::63: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here), group(group), | ^~~~~~~~~~~ 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf8_4, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf8_4, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf8_4, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor]/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn 670 | tid(tid), nthreads(nthreads), t) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_4, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tididInBlock(threadIdx.InBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_4, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf8_4, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_bf8_4, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_bf8_4, ncclFuncAllReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for gfx1030. 18 warnings generated when compiling for gfx1101. 18 warnings generated when compiling for gfx942. 18 warnings generated when compiling for gfx1102. 18 warnings generated when compiling for gfx908. 18 warnings generated when compiling for gfx906. 18 warnings generated when compiling for gfx1201. 18 warnings generated when compiling for gfx1100. 18 warnings generated when compiling for gfx1200. 22 warnings generated when compiling for gfx90a. [ 63%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_f16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_f16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_f16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_f16.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: | ^In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ In file included from | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from :218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp::2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11175: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ :29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:In file included from 2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h barrier_b:218:15:y warning: unused variable 'bid' [-Wunused-variable] 218 | _ group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | co n const sint bidt = nccl Shmem.cihannelInd -t w = threadIdx work->channelLo; | ^~~ .x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, f/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hlag2; | ^~~~~:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, - work->cdhannelLao; | ^~~ ta2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrieIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ r_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint3 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 2_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145In file included from :35: warning: unused variable 'flag2' [-Wunused-variable]/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPt 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ r(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int \ | ^ bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uintIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1,/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int b idata2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ d = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_g:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid =roup(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.hIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ :145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ : warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_gIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ roup(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_S175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable]In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ IZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ = ncclShmem.channelId - work->chaIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from n/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ nelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = rec vPtr(0)+ll128Offset; | ^~~ | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_2, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_2, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_2, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_2, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: id)in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here, nthre ads(nth reads), tidI565nBlock(t | hreadId x.x), g roup(gro up), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ r671 | sutepSizen(stepSiTze_ == 0r ? nccleShmem.ecomm.buUffSizpes[NCCLD_PROTO_oSIMPLEwn, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | g Priomitives<,T, RedO p, FanAsPymmetrirc, /*Di,rect=*/ 0, ProtoC, 0> priOms | ^ L/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:L_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDev565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreaFunc(AllReduce_TREE_SIMPLE_Prod_f16_2, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | Ruds, nworkW); o | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hr:432k:78: Bnote: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here a432 | t if (ctid , algo, proto, unroll>().run();NR OL\L>() .ru n(t| id ^, s ubt/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hn, work:); 670 | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_2, ncclFu:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tincdAllRIednuceB, FluncoProcd, khal(f, tNCChL_ArLGOe_TRaEE, dNCICL_PdROTxO_S.IMPxLE,) 2), | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hg:611:r62: onote: expanded from macro 'DEFINE_ncclDevFunc' u611 | p ( RugnWorrkBaotchu, ^~~~~~~~~~~ a lgo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_2, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_2, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncProd<__half>, ProtoLL128, 2>' requested here 1070 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tid runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); InBl| ock(th ^readId x.x), /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cppgroup(group), :| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_f16_2, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto,edOp, FanAsymmetric<1, NCCL_MAX_DEV_ARITY>, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown().,run() ; \ P| ^ rotoSimple<1, 1, COLL_UNROLL>, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_2, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PRIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | wOaTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f16_2, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInrpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSiBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' ze(nc c670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ lShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncProd<__half>, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_f16_2, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncc RunWorkBatch, algo, proto, unroll>().run(); \ | ^ lDevFunc(AllReduce_RING_SIMPLE_Prod_f16_2, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((ti warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_2, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ d%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { In file included from | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11 /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:: 1070:5: note: In file included from in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncProd<__half>, ProtoLL128, 2>' requested here 1070 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h runT:reeSp173lit(]tid, /nthreNads, wCork)C; | ^ L/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:_78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested hereS 432T | E if P(tid S< sub/tn) RsunWorikCollz().ru n(tids, subttn, weork); | ^ p/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:5:S1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here i 5 | DzEFINEe_ncclDevFunc(AllReduce_TREE_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims _ LL1| 28_P ^rod_ f16/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h_2, ncclFuncA:llRe565duce:, F5unc:Prod , hanote: lf, in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested hereNCCL _ALG O_TREE,565 NCC | L_PR OTO_ LL12 8, 2 ) | r^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hu:611:62n: note: Texpanded from macro 'DEFINE_ncclDevFunc' r611 | e RuenWorkUBatcph, algo, proto, unroll>().run(); \ | ^ >, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_2, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSiIn file included from ze_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl(| ).r ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~un( tid , s| ubt tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_n, w ork671 | stepSize(s)t; e| ^ p/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:S12:1i: note: zin instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here e12_ | D == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/siEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f16_2, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:zeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims 611:62 : note: | expanded from macro 'DEFINE_ncclDevFunc' ^611 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h RunWor:kB565atc:h, ProtoSimple<1, 1, 2>, 2>' requested heredop , alg565o, | pr oto , u nro ll>r().urunn(); T\ r| e ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.he:670U:15:p note: Dfield 'nthreads' will be initialized after field 'tidInBlock' o670 | w n t , CtOLLi_UNdROL(L>(ttid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEidF), InNthreEad_s(nnthrceadsc), tlidInDBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ evFunc(AllReduce_TREE_SIMPLE_Prod_f16_2, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f16_2, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(gr/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proup), | ^~~~~~~~~~~ oto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_2, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_2, ncclFuncAllReduce, /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSizeFuncPr(od, half,s NCCL_ALGtO_TREE,e NCCL_PROpTO_SIMPLES, 2) | ^ i/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc'z 611 | eRunWorkBa_tch, algo,s proto, u[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeofnrol(l>().runT(); \ | ^ )/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | : ste ptid(Size_) tid), nthreads(nthreads), t{ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565i | dInBlo ck(thr eadIdx .x), group(group ), | ^~~~~~~~~~~~~~~~~r /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60u: note: field 'group' will be initialized after field 'stepSize' n670 | T tid(tird), ntehreadse(nthreUads), ptidInBDlock(tohreadIdwx.x), ngroup(g, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work);In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMP/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_2, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREELE_Prod_f16_2, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo,, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_4, ncclFuncAllReduce, FuncProd, half,In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173 NCCL_ALGO_TREE, NCC: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthrNeROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_2, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_2, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSiz/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_4, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(te_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | i d), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_4, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_4, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_2, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ s(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | f (T) : s tepSiz e_ ) { s| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(groupt /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.he:63:56: pnote: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | PrimitivSes, 0, sProto, t0> priems | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hp:558Size_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PR:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, 2>' requested hereO 558 | T runROing(tidLE]/NCCL_STEPS/sizeof(T) : s, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:t12epSize:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | D_) {E | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ F| group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hI:303:90N: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here E 303 | _ PnrimitcivesA, /*Dlirect=l*/0,R Proteo, 0> dprimsu | ^ c/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565e:5: note: _in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565R | ING_SIMPLE_Prod_f16_2, ncclFuncAllReduce, Func runTreeUpDown, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tidLL_U(NROLLt>, CiOLL_dUNRO)LL>(,tid, nthnreadts, whork)r; | e ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested herea d432 | s ( if n(tid t< suhbtn)r RuneWorkaColld()l.runo(tidc, sukbtn,( wortk); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_2, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_hreadSIdxI.x)M, gProuLp(gErou,p), | ^~~~~~~~~~~~~~~~~2 /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h):670: 60: note: field 'group' will be initialized after field 'stepSize' | 670^ | t/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hid(tid:), 611nth:rea62ds(:nth renote: ads)expanded from macro 'DEFINE_ncclDevFunc', t idI nBloc611k(t | hre adI dx.x ), groRup(ugronup)W, o| ^~~~~~~~~~~ rkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f16_2, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_4, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' ll, ty, redop, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hif (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_2, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_4, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_2, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_4, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f16_2, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ), nthreads(nthreads), tidInBlock/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) :(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_4, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_Prod_f16_2, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_4, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Prot:670:15: warning: oinitializer order does not match the declaration order [-Wreorder-ctor] 670 | , tid(ti d), nth0reads(>nthreads) , tidInBlopck(threardIdx.x),i group(gmroup), s| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | st epSize(s| tepSize_ ^ == 0 ? /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) :/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | step:5 ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~: | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AlS note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558i | runRzing(tid, nthreads, /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hwork):670:;15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | t| ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432id | (tid), lReduce_RING_SIMPLE_Prod_f16_4, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ nth reads(n thread s), t if (tid < subtn) RunWorkColl().run(tid, su b stepStize(stenpSize_ ,== 0 ? ncclSwhmem.)oc { | ro ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group km/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:)m254:90: note: ;.in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 b | u Pri| fmiti ^vfes, /C*DiLrect=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, 1, 2, 4>::run' requested here l22 | DeEFIN ,ste, pSiz ne_)c { c| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ l| group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hF:63:uC56OLnL:_UcN AllReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4)note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | | Pr^imit ives/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h, 0, Pnote: rotexpanded from macro 'DEFINE_ncclDevFunc'o, 0 > pr ims | ^611 /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h | ROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_4, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ RunWorkBatch, algo, proto, un:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthrroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreadsea(dns,t whorrk);e a| ^d /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hs):432,:78 : tnote: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested herei dI432 | n B l oifc (kt(idt | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f16_2, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f16_2, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFI/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f16_2, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthNE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_rfeads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 16_4, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f16_4, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_4, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f16_2, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run();:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.b \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ uffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f16_4, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_4, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_4, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_4, ncclFuncAllReduce,/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, Ialgo, nprotoB, ulnrollo>().rcun();k \ | ( ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670t:15: note: hfield 'nthreads' will be initialized after field 'tidInBlock' 670r | e tida(tid)d, nIdx.x), group(grothreads(nthreads), tidInBlock(threadIdx.x), group(groupu)p), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_4, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f16_4, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f16_4, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_4, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid),/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_4, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, ti d(tid)w, nthroeads(rnthreakds), t)idInBlo;ck(thr eadIdx .x), g| roup(g ^roup) , | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | ste:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here pSiz e(ste22 | DEFINE_ncclDevFpSizue_ == n0 ? nccclShm(AllReem.commduc.bufe_fSizes[NCCL_PROTO_SIMPLE]/NCCRILNG_S_IMPLE_SProd_fT16_4, EncclFPunS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hcAllReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here ,254 | rPrimeitivdesric<,NCCL _MAaX_DElV_ARgITY,o 1>,, /*D irepct=*r/0,oto Pr,o unroll>().runto(, 0>) pri;ms | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h\: | ^ 565/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:5: note: :670:15:in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | run note: Tfield 'nthreads' will be initialized after field 'tidInBlock' r670 | e eUpDown, COLL_UNROLtid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x)L>(t,id, nthrgeadsr, woork)u; | p ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h(:432:78g: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here r 432 | o u ifp ()tid ,< su btn) Run| Work ^~~~~~~~~~~Coll ().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_4, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthr18 warnings generated when compiling for host. eads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f16_4, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h,:670 :15: warning: work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads),/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h tidInBl:o670:15: warning: cinitializer order does not match the declaration order [-Wreorder-ctor] 670 | k tid((tid), ntthreadsh(nthreadrs), tideIn432:78:Ba note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here ld432 | o I if (ctdx.x), grok(thrueadIdxidp. < (group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ x), group(group)s,ubtn) RunW orkCo| ll< ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~Fn, T , Red Op, Algo, Pr| oto, tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_COLL_ UNRO 671 | stepSize(steLLp>(S).irun(tzid, seubt | _n tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 , | = st=ewpSi zoe(s0terp Size?_ == k0 ? nn)cclcS;hmecm .col mmShmem.comm.b| ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_.bnuffSiczes[NcCCL_lPROTO_DSIMPLeE]/NCvCFunc(AllReduce_RING_SIMPLE_Prod_L_STfEPS/si1zeof(6T) : s_tepSi4ze_) ,{ | ncclFunc ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | PrimiuftfSizies[NvCCL_ePROTO_SIMsPLE], FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here n303 | Asymmetric<1, NCCL_MAX_DEV_ PrimitivesReduc,e, F unc/Prod,* halDf, NiCCL_rALGO_eRINGc, NCtCL_P=ROTO_*SIMP/LE, 4) 0| ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h,:611: 62: note: Pexpanded from macro 'DEFINE_ncclDevFunc' r611 | o RuntWorkoBatch,RIrT Y>e,p /d*rDioriepctm=<*s/0, Pr otty>, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:o565,: 05> :pr imnote: sin instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h :565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeU p565D | o w rnundO,p, PCroOtoLSiLmp_lUe_U(NRtOLiL>d, C,OL L_nUNtROhLLr>(etida,ds, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested herent hr ead432s, | w or k) ; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h i:432f:78 : note: (in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here t i432 | d < if (tid < subtn) RunWorkCo slubltn<) FRunnW,or kCTol, RedOp, Algo, Proto, COLL_UNROLL>().l().run(tid, subtn, wo, rsukbt)n,; w or k)| ; ^ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:17::1:17 note: :1: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here17 | 17 | DEFINEDE_FInNEc_ncccllDDevFeuvncF(unc(AllReduce_TREE_SIMPLE_Prod_f16_4, nAlclRcelduFceu_TREEn_ScIMAPLlE_lPrRod_ef1d6u_4c, encc,l FunFcAullnRecducPe,r FoundcP,ro d,h haallf,f N,CC L_NALCGCL_ALGO_TO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorREkE,B NCaCLt_PcROhTO<_ScIMoPLlE,l 4,) | t^ y/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h,:611 :62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatchre,do p,g aolg,o, pprortoo, tunrool,l> unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthr()e.raund()s; (\ n | t ^ h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hr:670e:15a: dnote: field 'nthreads' will be initialized after field 'tidInBlock's )670, | t tiidd(tIidn),B nlthorecadks((ntthrheardes)a, tdidIIndBx.lockx(th)re, agdIrdxo.xu),p g(rougp(rgroouup)p, | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h),:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(th | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ readIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), gro/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hup(:670:15:g warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | r tid(otid), nthreadsu(nthreadsp), tidI)nBlo, ck | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(thread(Idx.x), sgroup(groutp)epSize_ == 0 ?, | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671n | cclShstepmSiem.ze(scomm.buffteSpSize_ =izes[NCCL_P= 0 ? ncclShmem.comm.buffSizes[NCCL_PROTOROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitivesnote: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | , Primit ive0, Proto, s<0T, RedOp>, F prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing, ,0, P roto,C 0> pOrims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hL:558:5L: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 4>, 4>' requested here _ 558 | U runRNingedOp, Proto, COLL_UNROLL>(tid, nt(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) Ruhrenads, wWork);o | ^ r/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432k:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested hereC 432 | o l if (lll().run(tid, subtn<,Fn, T, RedOp, wAlgoo, Prrotok, CO)LL_UN;ROLL >(). run| ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE(t_id, nsubtnc, wocrkl); D | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cppe:22:vFunc(AllReduce_RING_SIM1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_PLE_Prod_f16_4, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611f | 16_ 4, ncclF uncA llReRduceu, FunncPrWod, ohralfk, NBCCLa_ALtGO_cRINhG, , algo, proto, unroll>().rrkBautcnh , a| lgo ^, p rot/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.ho, unr:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | oll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' ),670 nth | rea ds( nth re ads),tid(ti tiddIn)Blo,ck( thrneadtIdxh.x)r, egroaup(dgrosu(nthreads), tidInBlp),o | c ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hk:670:(60: tnote: field 'group' will be initialized after field 'stepSize' h 670r | e taid(dtidI), dnthxrea.ds(xnth)rea,ds) , tgidIrnBloocup(group), | ^~~~~~~~~~~ k(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f16_4, ncclFuncAllReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for gfx942. 18 warnings generated when compiling for gfx1101. 18 warnings generated when compiling for gfx1100. 18 warnings generated when compiling for gfx1200. 22 warnings generated when compiling for gfx90a. 18 warnings generated when compiling for gfx908. 18 warnings generated when compiling for gfx1030. 18 warnings generated when compiling for gfx1102. 18 warnings generated when compiling for gfx1201. 18 warnings generated when compiling for gfx942. 18 warnings generated when compiling for gfx906. 22 warnings generated when compiling for gfx90a. [ 63%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_f32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_f32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_f32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_f32.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp 18 warnings generated when compiling for gfx906. 18 warnings generated when compiling for gfx908. 18 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 18 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = reIn file included from cvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 2718 warnings generated when compiling for gfx1200. | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 18 warnings generated when compiling for gfx1100. 18 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadI/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ dx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ unused variable 'bid' [-Wunused-variable] 218 | cons/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.ht:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 18 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_g/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ roup(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2;/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ :29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_grIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ oup(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ 1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = reIn file included from cvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from :218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ :218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ [ 63%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_f64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_f64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_f64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_f64.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hPrimitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_4, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthr/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.heads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | | stepS ize(ste pSize_t == 0 i? nccldShmem.c(omm.butffSizesi[NCd), nthreads(nthreaCL_PROTO_SIMPLE]/NCCL_STEPS/sids), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_Szeof(T) I: stepMSize_) P{ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ L | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hE:303:90: note: ]in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | / NPrimitCives, //*Diresct=*/0i, Prozeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here to, 0 > prims | ^254 /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565 | :5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runT reeUpD own, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDsymmetric, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ own, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_4, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ rotoSimple<1, 1, COLL_UNROLL>, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | ); \ | ^ tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_4, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 670 | tid(tid), nthreads(nthreads), tid11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ InBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_4, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) :/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f32_4, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_4, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, //builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h*Direct=*/0, P:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLroto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2)Fn, T, R | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ edOp, Algo, Proto, COLL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_4, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatchhre, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | ads ), tid InBloc k(thretadIdx.xi), gdroup(gr(oup), t | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | i tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | d ste)p, nthreads(nthreads), tidInBlock(threadIdx.x), grSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSioup(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); (nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ze_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx./builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCLx_), gSroup(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ TEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_4, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, pro18 warnings generated when compiling for host. to, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f32_4, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f32_4, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid( runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); t| id) ^, nth read/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hs(nthreads), :tidI432nBloc:k(thr78eadId:x.x), grounote: p(grin instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested hereoup) , 432 | if (tid | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ <| tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | s steupSizbe(sttepSinze_ =)= 0 ? ncRclSuhmemn.comWm.borkColl().run(tid, subtn, wNCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:ork); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_4, ncclFuncAllReduce, FuncProd, 90:f note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here l 303 | o at ,Pri mitNiCvesCR, /O*DiTrecOt=*/_0, SPrIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | oto , 0R> purimsn | W ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.ho:r565:5k: note: Bin instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here a565 | t c ruhnTr, COLL_UNROLL>(tiy, redop, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15:d, nthnote: reafield 'nthreads' will be initialized after field 'tidInBlock'ds, w ork);670 | | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h: 432:78 : note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here t432 | i d(tid), nthread sif( (ntitd h, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | eubtan)d Rsun)W,ork Cotlli, /*Direct=up(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_4, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkB60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreaOdLLs>(().nrtunh(tride, asdubstn), ,wo rkt);i | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFuncatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ (AllReduce_TREEdInBlock(threadIdx.x), group(g_SrIoMPuLEp_P)ro,d_ f3 2_| ^~~~~~~~~~~4, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Pri< subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ mitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f32_4, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadgIroup(dgroupx), | . ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ x671 | )step,Size(st epSizeg_ == 0r ? nccolShmemup(group.comm).buff, | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tSizies[NCdCL_PIROTOn_SIMBPLE]l/NCCoL_STcEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_4, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f32_2, ncclFuncAllReduce, FuncProd, /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(groufloat, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ p), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_4, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_4, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f32_4, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_4, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCC/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | L_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? nccl/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hS:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_4, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ hmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f32_4, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_4, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ .buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f32_2, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_4, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h (tid < subtn) RunWorkColl().run(tid, subtn, work:)670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_4, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_4, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f32_4, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | r/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_4, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), unRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:| 22 ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f32_4, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h, unroll>().run(); \ | ^:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | ti d/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670(:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), ntthrid), nthreades(nthreaadds(nthreasd), tidIsnBl), ock(threadtidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSizIdex.x), gr_oup(gro up), | = ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60=: note: field 'group' will be initialized after field 'stepSize' 670 | ti0d(tid) , nthr?eads(n threadsncclS), htimem.codInmBlockm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ (| thre group(groupad Idx.x/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h), group(grou:p),303 | : ^~~~~~~~~~~ 90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_4, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f32_4, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f32_4, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f32_4, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f32_4, ncclFuncAllReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | u/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: intnote: 32_t dataexpanded from macro 'barrier_by_group'1, flag1, data2, flag229; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h | :145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uin t32_t dat a1, flag1 , data2, cflag2; o | ^~~~~ nst int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ :218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = thrIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ eadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, f/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ lag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t daIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ ta1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h :11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175 : uint32_t data1, flag1, data2/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ , flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp :2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:1529: warning: unused variable 'bid' [-Wunused-variable] | 27 | cons t int bid = ncclcShmemo.chnannelsId - twork-> channielLo;n | ^~~ t w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_bIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29y_group(); | | const int w = threadIdx.x/WARP_SIZE; \ | ^ ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ uint32_t data1, flag1, data2,In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2;/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | | ^~~~~ const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_2, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_2, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_2, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_2, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_2, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f64_2, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown(, COLL_UgNROLL>(rtid, ntohreads,u work);p | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h):432:78,: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (t| id < su ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~btn) Ru nWorkCo ll | stepSi().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Alze(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | l Reduc e_TRE E_SIM PLE_P rod_fP64_2, rncclFiuncAlmlReduice, FutncProid, dovublee, NCsCL_AL,TO _SIMP/LE, 2*) | ^ D/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62i: note: expanded from macro 'DEFINE_ncclDevFunc' r 611 | e RuncWorkBtatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_2, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto,:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidI/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] nBlock(threadIdx. x670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | ), group(group), | ^~~~~~~~~~~ Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_2, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> In file included from prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp::2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h5:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173:: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15 : warning: initializer order does not match the declaration order [-Wreorder-ctor] note: 670 | in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here tid(t id), nt hreads(nt565hreads | ) runTreeUpDow, tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_n, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run:254:(90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here t 254 | i Prd,im subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cppitives, /*Direct=*/0, :7:1P: rnote: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested hereoto, 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_2, ncclFuncAllReduc0e> pr,ims | F ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hu:565:n5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested herec 565P | r runoTreedUpD, downo, COL/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hL_UN:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkROLBL>(taid, ntthrecadsh, wo, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl , aPlgor, porotto, uonro,ll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads) CO,LL_ UNRtOLLi>()d.ruIn(tind, Bsubltn, oworkc); k | ^ (/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cppt:7:h1: readIdx.x), grounote: pin instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here (7 | DgEFIrNE_oncculDevFunpc), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthr(AlelReaducde_TsREE(_SInMPLtE_Phrord_f64e_2,a ncdclFusncA)llR,edu ce,t FidInBlock(threuncProd, double, NCCL_ALGO_TREE, aNdICCdLx._x)P, RgrOouTp(Ogr_ouSp)I, M | P ^~~~~~~~~~~ LE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_2, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShme/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_4, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), m.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Dir:ect=*/0, 15Proto, 0> p:rims | ^ warning: initializer order does not match the declaration order [-Wreorder-ctor]/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565: 5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | 670 | tid(tid), nthreads(nthreads), tid mr.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f64_2, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ IunTreeUpDonwn, COLL tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize__UNROLL>(t id, nth read671 | stepSize(ss, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(titepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives().rusn(tid, ysubtn, mwork); m | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cppe:7:1: note: tin instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7r | DEFINE_incclDevcFunc(Al, /*Direct=*/0,E_Prod_f64_2, ncclFuncAllReduceepSi,ze_ = = 0 F? ncuclShmem.comm.buffSizes[NCCL_PROT OProt_o, 0S> prIims | M ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hP:565:5L: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested hereE 565] | ncP/ rodN , CCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56 runTreeUpDown, COLL_UNROLL>(tid, nthreads, wdouoble,r NCCkL_AL)GO_TR;EE, NCC L_PR| OTO_ ^SIMP LE,/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h::611:62432: note: expanded from macro 'DEFINE_ncclDevFunc': 61178 | : Run Worknote: Batin instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) Rcuho, allgol, p()T.ru,n() ; \R | e ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hd:670O:15:p note: field 'nthreads' will be initialized after field 'tidInBlock' , 670 | A tlidg(tiod),, n thrPeadrs(ntohr: note: ein instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here a63 | d s Pri)mit,ives , 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hto, C:OLL432_UN:RO78LL>:(). runnote: (tiin instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested hered, sub tn, wo432rk) | ; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp: 7:1 : note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | iDEIfFInBlNo Eck_((tnthrcieacddIl dxD<.xe ),vs guFroubupnt(gcnro)(up A),Rl u| l ^~~~~~~~~~~~~~~~~n R/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hWe:670od:60ru: kcnote: field 'group' will be initialized after field 'stepSize'Ce o_670 | T R tiEd(Etid_), SnthIreaMds(PnthLreaEds)_,l Pl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(All | R RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ educe_RING_SIMPLE_Prod_f64_2, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f64_2, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, In file included from nt/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARhPreads, _work); S | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hI:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested hereZ 432 | E if ()tid < s,ubtn) R unWorkC oll().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllRedu| ~~~~~~~~~~~~~~~~~~ c | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507e | warp_InBlockR(threING_SIMPLE_ProdadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.b/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSizuffSizes[NCCL_PROTO_f64__2, nLcclFuLncAll1Reduc2e, Fu8ncPro]d, do/uble,N NCCLC_ALGOC_RINGL, NCC_/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hSTEPSL_PRO/Tsizeof(uintO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611e | (stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, algo, proto, unroll>().r:670:15:u warning: initializer order does not match the declaration order [-Wreorder-ctor] n670 | <1, 1, COLL_UNROLL>, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFu tid(t(id), )nthre;ads(nt hread\s), t idInB lock(| threa ^dIdx. x), g/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hroup(group), : | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ 670| tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ :671 | s15tepSnc(AllReduce_TREE_SIMPLE_Prod_f64_2, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :ize (stepnote: Size6field 'nthreads' will be initialized after field 'tidInBlock'_4_t)) { =| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | = group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h :670421:90: | note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here ? 421 | n pctrimsci(tidl, nthSreadsh, treme->doewn, tmree-.>dowcn, woork->smendbumff, w.ork->brecvbuuff, wofrk->rfedOpASrg); i zes[NCCL_PROTO_SIMPLE]/NCC| ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h_UNROLL:>(tid565, nt:hread5s, wo:rk); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hnote: :432:in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if 565(tid | < su runTreeUpDown().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREotoSimple<1, 1, COLL_UNROLL>, COLL_UNROLL>(tid, nthreads, wE_oLL1r28_kPro)d_f;64_ 2, ncc| lFu ^ncA ll/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tReidudce,, Fu ncPsrodu, dboubtle,n NC,CL_ ALGOwork); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DE_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ FINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_4, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_f64_2, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%In file included from W/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nARP_SIZE), warp(tid/threads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_2, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(gin instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_f64_2, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_Troup), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_2, ncclFuncAllReduce, FuncProd, double, REE, NCCL_PROTO_LL128, 2) | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>():670NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | .run(); \ | ^ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_4, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_2, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>(/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0).run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), n t? nhcclrShmeem.acommd.busffSi(zesn[NCtCL_hPRrOTOe_SIaMPLEd]/NsCCL)_ST,EPS/ sizteofi(T)d : sItepnSize_Block(threadIdx.x), group(group), | ^~~~~~~~~~~ ) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthre/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidI().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_4, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611n:Blo62ck(t:hre adInote: dxexpanded from macro 'DEFINE_ncclDevFunc'.x) , g roup(611gro | up) , | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671R | u stnepSWizeo(strepSkizeB_ =a= 0t ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock,( RetdOph, FranAesymametdri/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] cI, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_4, ncclFuncAllReduce, FuncProdDrEV_oARuITpY,) 1,>, / *D| i ^~~~~~~~~~~~~~~~~re ct/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h=*/:0,670 P:ro60to:, 0>note: pfield 'group' will be initialized after field 'stepSize'ri m s | 670 ^ | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h :565 :5 : note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested heret i565 | d ( t riundTr)ee,Up Donwnt,d CIOLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hnBl:oc432k(threadIdx., double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ x), gro:u78:p note: (in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here g r432 | o up), | ^~~~~~~~~~~ if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_4, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[In file included from N/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_2, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Proto, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f64_4, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f64_2, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work)/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_4, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), n; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f64_4, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ roll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlogrocup), k | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ (| tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | t sthepSizer(stepSieze_ ==a 0 ? ncdclShmeImdx.x.)comm, group(.buffSgizeroup), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | s[NC CL_PRO TO_SIM PLE]/N CstepSize(stepSize_ == 0 ? ncclShmem.comm.buffSiCzL_STEPeS/sizesof(T) [: stepNSize_)C { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~C | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hL:63:56:_ note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | P ROTO_SIPrimitives, 0, PMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hro:to, 0>303 pri:ms | ^90 /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558::5: note: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here P 558 | r runRiingmitives(t<1, NCCid, nthLr_MAX_DEV_ARITY>, /*Direades, worck); | t ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432=:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here*/0, Proto 432 | i, 0> prims f (tid < sub| tn ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h) Ru:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here nWorkColl565 | runTr().run(tieeUpDown, COLL_UNROLL>(tid, nthreads,d, subwtn, oworkr); k| ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp):12:;1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEF| INE_ ^nccl DevF/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hunc(AllR:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subedutce_RnING_S)IMPL E_PrRod_fu64_2n, nWccloFuncrAllRkeColl().run(tid, subtn, wALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | o rkRunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_4, ncclFuncAty,l rledoRp,d alugo,c preoto,, un rollF>()u.runn()c; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hP:670r:15:o note: field 'nthreads' will be initialized after field 'tidInBlock'd ,670 | dtido(tuble, NCCL_ALGO_TREE, NCCL_PROTidO), _nthSreaIds(MnthPreaLds),E ti,dIn Blo4ck()thr e adId| x.x^), gro/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ up(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | ste/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] pSize(stepSize 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_2, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatc_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_2, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ FanAsymmetric<1, NCCL_MAX_DEhV, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ _ARITY>, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, wor/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreadks(nthrea)ds), tid;InBlock (t h| ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_2, ncclFuncAllRereadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCC/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_4, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ L_STEPS/sizduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLEeof(T,) : st epS2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: izenote: _) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:expanded from macro 'DEFINE_ncclDevFunc'90: 611 | RunWorkBatch, note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here a254 | l Primigtives().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670Fan:Asym15metr:ic, /670*Di | rect =*/0, Pro to, 0>t pid(rimtid), nthreads(nthreads s| ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h):565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown,n COBLL_UlNROLLo>(ticd, nkthre(ads, work);t | ^h /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hr:432:78:eadIdx.x), group( gnote: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here r 432o | u p )if ,(ti d < s| ^~~~~~~~~~~ ubtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_4, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_2, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_4, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(tLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, swubtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_4, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ork);hreadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f64_4, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_4, ncclFuncAllReduce, FuncProd,/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h double,:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads( NCCL_nALGOt_TREE,h NCCL_PrROTO_SeIMPLE,a 4) | d^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:s611:62: note: )expanded from macro 'DEFINE_ncclDevFunc' 611 | , RunWorkBatch< coll, tity, redop, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.dInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Prix),mitives group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hlgo, Proto, COLL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f64_2, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f64_4, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_Prod_f64_2, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid,/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h :n670:15t: warning: initializer order does not match the declaration order [-Wreorder-ctor] h670 | rtid(tid)e, nthraeads(ndthreadss), tidIn,Block( threadIdwx.x), grooup(grorupk); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:), 432| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ : 67178: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | i | f step Size(st(epSize_ t/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreadid < subtn) RunWor== k0 ? nccClShmem.coommll().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDs(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f64_2, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ evFunc(AllReduce_R: steIpSize_)N { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ G | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:_63:56: SIMPLE_Prodnote: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here _63 | f Prim6itive4s, 0O, Pro_to, 0>S primIs | ^M /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558P:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested hereL 558 | E ru,nRing ^(tid, nthr/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.heads, work:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | ); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | RunWorkBatch, algo, proto, unroll>().r u inf (tid( < su)btn) ;RunWor kCol\lfield 'nthreads' will be initialized after field 'tidInBlock'().run (tid 670 | tid(tid), nt, suhbtn, rwork)e; | a ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cppd:s(nthreads), tidInBl22o:1: note: cin instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22k | DEFIN(E_nctclDevFhureadIdx.x), gncr(AlloReducue_RINp(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60:G_ SIMPLnote: E_Prfield 'group' will be initialized after field 'stepSize'od_f6 4_4, nccl670 | tid(tid)Func,AllRe duce, nFuncProd,threads(nthreads), tidInBlock(threadIdx.x), group(grou double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ p), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f64_4, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthread/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hs), tidInBlock(threadIdx.x):670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), , ngroutp(grhoup),r | e ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:a670:60:d note: field 'group' will be initialized after field 'stepSize' s 670 | (nthread s t)id(ti,d), nthrteidInBlock(threadIdx.x), grads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ oup(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f64_2, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_4, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_4, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_4, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSi/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f64_4, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ze(stepSize_ == 0 ? ncclShmem.comm.buffSize/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, ws[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_4, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , NCCL_MAX_DEV_ARITY>, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, wor 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_4, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Proto, COLL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_4, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_4, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f64_4, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(Al/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f64_4, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ lReduce_TREE_SIMPLE_Prod_f64_4, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f64_4, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f64_4, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f64_4, ncclFuncAllReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for gfx1100. 18 warnings generated when compiling for gfx1102. 18 warnings generated when compiling for gfx1201. 18 warnings generated when compiling for gfx942. 18 warnings generated when compiling for gfx1030. 18 warnings generated when compiling for gfx906. 18 warnings generated when compiling for gfx908. 18 warnings generated when compiling for gfx1200. 18 warnings generated when compiling for gfx1101. 18 warnings generated when compiling for gfx1200. 18 warnings generated when compiling for gfx1102. 18 warnings generated when compiling for gfx908. 18 warnings generated when compiling for gfx1100. 18 warnings generated when compiling for gfx906. 18 warnings generated when compiling for gfx942. 18 warnings generated when compiling for gfx1101. 22 warnings generated when compiling for gfx90a. 18 warnings generated when compiling for gfx1201. [ 64%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_f8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_f8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_f8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_f8.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp 18 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:1: In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ :12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ :15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.hIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ :173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll1:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 28Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group();In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h::218In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ 175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ :366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ mem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channel22 warnings generated when compiling for gfx90a. Lo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ [ 64%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_u32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_u32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_u32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_u32.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_2, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_2, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_2, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_f8_2, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_2, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_f8_2, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_2, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_f8_2, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f8_2, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_2, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_2, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); ROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_2, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_2, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_2, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_2, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ :611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_2, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_2, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_4, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f8_2, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_2, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_2, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_2, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_2, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f8_2, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_2, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f8_2, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_2, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_2, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_4, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_4, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_2, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f8_2, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f8_2, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_Prod_f8_2, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f8_4, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_4, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f8_2, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f8_2, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_2, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_4, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_4, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f8_2, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f8_2, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_4, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_4, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_4, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f8_4, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NAsymmetric<1, NCCL_MAX_DEV_ARITY>, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_4, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ CCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f8_2, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_4, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_4, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_4, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_4, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_4, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f8_4, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_4, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f8_4, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f8_4, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_4, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_4, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_4, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_4, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_4, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSiz tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ e(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f8_4, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn,/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_f8_4, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f8_4, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f8_4, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f8_4, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f8_4, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_f8_4, ncclFuncAllReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | constIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ :2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ :27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11:218:15: : warning: unused variable 'bid' [-Wunused-variable] 218In file included from | co/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.hnst int: bid = 173ncclShmem.channelId - work->channelLo; | ^~~ : /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from :218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ Lo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_4, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_4, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ (threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u32_4, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_2,/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_4, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_T/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ REE_SIMPLE_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_4, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_4, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u32_4, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_4, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_4, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_4, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads),proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_4, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_4, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_4, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_4, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), t/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hi:d670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_ProInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u32_4, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreadd_u32_4, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmems(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ .comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_4, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_4, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_4, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u32_2, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_4, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDow/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hn, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, s:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShubtn, wmork); e| ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cppm:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here . 17 | DEFINcE_ncclDoevFunc(mAllReducme_TREE_.SIMPLE_bProdu_ffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/su32_4, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' izeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRingo, algto, protoo, unrol,l>().ru n(); \ C | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670O:15: note: field 'nthreads' will be initialized after field 'tidInBlock' L670 | L_UNROLL>(tid, nthr tid(teid),ads nthreads(nthreads), tidInBlock(threadIdx.x), group(group),, wo rk ); | | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h ^~~~~~~~~~~~~~~~~: 432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h if :(t670:60: note: field 'group' will be initialized after field 'stepSize' 670id | < subtn) RunWorkColl()nBl.run(tid, subtn, woocrk(thkread)Idx.;x), group (gro| u ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u32_4, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ p), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u32_4, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u32_4, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u32_4, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | PrimiedOp, FanAsymmetric, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, twives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u32_4, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ork); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_4, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_4, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u32_4, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_4, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u32_4, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u32_4, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u32_4, ncclFuncAllReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for gfx908. 18 warnings generated when compiling for gfx906. 18 warnings generated when compiling for gfx1201. 18 warnings generated when compiling for gfx1101. 18 warnings generated when compiling for gfx942. 18 warnings generated when compiling for gfx1200. 18 warnings generated when compiling for gfx1100. 18 warnings generated when compiling for gfx1030. 18 warnings generated when compiling for gfx1102. 18 warnings generated when compiling for gfx942. 22 warnings generated when compiling for gfx90a. [ 64%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_u64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_u64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_u64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_u64.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group();In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ IZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:Id - work->c2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ hannelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | em.channelId - work->channelLo; | ^~~ const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, dIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_gata2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ roup(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flbarriear_by_ggroup()1; | ^~~~~~~~~~~~~~~~~~ ,/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' d29 | aconst intt w = athread2Idx.x/,WARP_SI ZE; \ f | ^ lag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp :2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h| :11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h: ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hwarning: :27:15:unused variable 'w' [-Wunused-variable] warning: 80 | barrier_by_gunused variable 'bid' [-Wunused-variable] 27 | r coonst intu bid p(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:= ncclShmem.channelId - work->channelLo; | ^~~ :29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from 15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, fla:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = nccg2; l| ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.hS:145:28:h warning: unused variable 'data2' [-Wunused-variable] m145 | e uintm32_t .data1c, fhlag1,a data2n, flang2; e| ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.hlI:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uinIn file included from td - work->channelLo; /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp| ^~~ :2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h32_t data1, flag1, data2, flag2; | ^~~~~ :366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.hIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ :29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_2, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_2, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_2, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads)In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_2, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock', tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_ST E 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ PS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_2, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_2, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_2, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nt:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ :670| :15: warning: initializer order does not match the declaration order [-Wreorder-ctor] tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 670 | ti d(tid), nth671reads( | nthre ads), t idInBl ock(th readIdsx.x), tgroup(geroupp), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~S | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ i671 | zstepSeize(stepShrieads, wzork); e | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h_:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432= | = if (t id < su0btn) R unWork?Coll()..run((btustepfSizefi_ =Sd= 0i, ? z ncceslSsuhm[bem.NtcomCnm.bC,uffL Siz_PRworOkes)T[NC;OC_SIMPLE]/NCC | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDeL_PROTO_SIMvPLE]F/NCCuL_STnEPS/csizeo(f(T)A : sltepSlL_iSRTEzePS/sdeizeu_of(c)T) e: s_{tepT SizR e_| )E { ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ E | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~_ | group(groupS | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hI group(groupMPLE_Prod_ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here:303:90 : note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | 303P | r Primitives, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , NCCL_MAX_DEV_ARIymTmetY>, /*Direct=*/ric<1, NCCL_MAX_DEV_ARITY>, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTre0e, PUrotpo, D0> opriwms n| ^ , ProtoSimple<1, 1, 2>, 2>' requested here R565 | e d ruOnTrpeeU,pDo wn,L CO>LL_U, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorNkROLLC>(toid,l ntlhre, 0, 2, 2>::run' requested here e432d | O p i,f ( tidA < lsubtn) RunWorkColl()go,. Prrotou, CnOLL(tid, subtn, work); | ^ _/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cppUNROLL>(:).r7un(:tid1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here , subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Al7l | DERFINeE_dncuclDcevFuenc(_AlTlReRducEe_TREE_SIMPLE_Prod_u64_2, ncclFuncAllRedEu_SIcMPLeE_P,rod_ uF64_u2, nnccclFuPncAlrlReduce, FuncPordo,d ,u iunitn6t46_4t_,t ,N CNCCLC_LA_LAGLOG_OT_RTEREE,E ,N CNCCLC_LP_RPOTROO_TSOI_MSPILMEP,L E2,) | ^2 )/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h :| 611:62^: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hnote: expanded from macro 'DEFINE_ncclDevFunc': 611 :61162 | : note: expanded from macro 'DEFINE_ncclDevFunc' R u611n | W o r k BRautncWhore, daolpgp,r oatlog,o ,u nprrooltlo>,( )un.rroulnl>(()).;r u\n ( )| ; ^ \/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h :| 670: ^15 :/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h note: :field 'nthreads' will be initialized after field 'tidInBlock'670 : 15:670 | note: field 'nthreads' will be initialized after field 'tidInBlock' 670ti | d ( t i dt)i,d (ntidt)h,r enatdhsr(enatdhsr(enatdhs)r,e atdidsI)nB,l otcikd(ItnhBrealdoIcdkx.(xt)h,r egardoIudpx(.gxr)o,u pg)r,o u p| ( ^~~~~~~~~~~~~~~~~g r/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hou:670:60p:) ,note: field 'group' will be initialized after field 'stepSize' | ^~~~~~~~~~~~~~~~~ 670/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h | :670:60 : note: tfield 'group' will be initialized after field 'stepSize'i d (t670i | d ) , nttihdr(etaidds)(,n tnhthrreeaaddss)(,n titdhIrneBaldosc)k,( tthirdeIandBIldoxc.kx()t,h rgeraoduIpd(xg.rxo)u,p )g,ro u p| ( ^~~~~~~~~~~ group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_u64_2, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_u64_2, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ NROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u64_2, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthread/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tids)I, tidInnBlocBk(thrleadIdox.x),c groukp(gr(oup),t | ^~~~~~~~~~~h readIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | PrimitivesIn file included from , 0, Proto, 0> prims | ^/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthrea /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hd:558:s5: note: )in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here ,558 | trunRiing(tid| , nthreads, w ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ork) ; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h| :432: tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | 671if ( | tid < sub tn) RunWorkColl().run(tid, stespSizue(stbepSitze_ n== 0 ,? nc clShwmeom.cormm.bkuffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp,:12:1 : note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here/ 12* | DEFDINE_incclrDevFeunc(cAllReduce_RING_SIMPLE_Prod_u64_2, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: texpanded from macro 'DEFINE_ncclDevFunc'=*/0 , Pr oto, 0>611 | RunWorprkimsB | ^a /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:t565:5:c note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here h 565 | < rcunTreoeUpDlownSimple<1, 1, COLL_UNROLL>, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: , anote: lgoin instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here, pro to, unroll>(432).ru | n( if (tid < subtn) RunWo); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ rkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_2, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_u64_2, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_2, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u64_2, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u64_2, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_2, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_2, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_4, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h RunWorkBatch, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u64_2, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(thre, ty, redop, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreaaddIdx.x), group(group), | ^~~~~~~~~~~ s), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_2, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkOLL>, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_Coll().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_2, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ TREE_SIMPLE_Prod_u64_4, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_2, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.co/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hmm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) ::670: stepSize_15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, 671 | / s*tepSiDzeirect=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5:(step Size_note: == 0in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here ? n cclSh mem.comm.565buffS | izes [NCCL runTreeUpDownstepS,ize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~C | group(group O/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl, /_*Di/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_2, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthrUNROLL>().run(tid, subtrenct=*/,0, Pr oto, w0> proims | r ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:k565:5: note: )in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here ;565 | runT reeUpDown<| T, Re ^dOp,/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp: P17rot:o:670:15:S1 warning: initializer order does not match the declaration order [-Wreorder-ctor] i: 670 | m note: ptid(in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested hereltid) e, n ,c Cgr(oOup)A,L | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~lL lReduce__UNROLTL>(| tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_Rt 671i | ds, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: tepnote: Sizin instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested heree(s tep SizeEE_SIMPLE_Prod_u64_4, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ _432 == | 0 ? nc clS hme m.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STE if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ PSeads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /sizeof(/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cppT) :: st17ep:Siz1e_): { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ note: | group(groupin instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h :303 :90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here17 | 303 | D E F PrIimiNtivEes<_T, RnedcclDevFunOp,c Fa(nAsAymmletrlic, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, algo, proto, unroll>().rumplne<1(, 1), C;OLL _UN\ROLL>, CO L| ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15L:_UNROLL>( tidnote: , nfield 'nthreads' will be initialized after field 'tidInBlock'thr ead s, wo670rk) | ; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h :432:78 : note: tin instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here i432d | ( tid), nthread s if (ti(d < nsubtn) RtunWhorkCrolle().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:7:o1ck:(th renote: adIin instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested heredx. x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nth 7r | DEeFINaE_ndcclsDev), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Func(AllReduce_TREE_SIMPLE_Prod_u64_2, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_4, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_2, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_4, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u64_2, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthrea: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ds(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_4, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threa/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_4, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ dIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u64_4, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(step/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_S:IMPLE]/NCCL670_STEPS/si:zeof(T) : 15stepSize_:) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303warning: :90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested hereinitializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthread 303 | s Prim(itives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_2, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ threads)FanAs,ymme tidInBlock(tric<1, NCCL_MAX_DEV_ARITY>, /*Direct=*/0, Proto,threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 0> prim63s | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h | :565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTree UpDown,y COLL_mUNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | , COLL_rUNROLLu>()nRing(tid, nthreads,.r un(tidw, subton, worrk); | k ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:7):1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here; 7 | D EFINE_ ncclDe| vFunc( ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFexpanded from macro 'DEFINE_ncclDevFunc'u 611 | nc(All R RuenWordkBatuch,NG_SIMPLE_Prod_u64_2, ncclFu algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ncAllReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h(threadIdx.x), group(group),:670 :15: warning: initializer order does not match the declaration order [-Wreorder-ctor]| ^~~~~~~~~~~~~~~~~670 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h tid(:tid670:60: note: field 'group' will be initialized after field 'stepSize' 670 | ti), dnth(reatds(indthr)ea,ds nthreads(nth), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ reads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u64_2, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_4, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_T), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u64_4, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ REE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_Prod_u64_2, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_4, ncclFuncAllRedgroup), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | tOp, FanSymmetric<1>, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | ruid(tid)n, nthreRads(nthrieads), ntidInBlgock(thre(tigroup)d, | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ , | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stenpSize(threads, workstep)Size_ ==; 0 ? | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 nccl | Shmem.c omm.buf fSi if (tid < subtzes[nNCCL_PR)OTO_SIMP LE]R/uNnWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(:90:A note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here l254 | l PrRimitievesN, /*DGirec_t=*/0S, ProIMtPoLE_Prod_u64_4, nccl, 0>F prims | ^u /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565n:5: note: cAllReduce, FuncProd, uinin instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565t | 6 runT4reeUp_Down, COLL_UNROLL>(tid, , 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tnithredads,) wor,k); | ^ n/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432t:78: hnote: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here r 432 | e a ifd (tsid <( subntn) tRunWhorkCroll(I).runn(tiBd, slubtno, wocrk); k | (thr ^ e/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cppadIdx.x), group(g:17:r1: onote: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here u 17p | DE)FIN,E_nc cl De| ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60:vFunc(AllReduce_TREE_SIMPLE_P note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(throreadIdx.x), group(group), | ^~~~~~~~~~~ d_u64_4, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitive/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hs, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_4, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ anSymmetric<1>, 0, Proto, 0> prims | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here: 670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | 558 ti | d(tid), nthrea ds(nthr eads), runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 0 ? ncclShmem12.comm. | buffSiDzes[NCCEL_PROTFO_SIMPILE]/NCNCL_STEEPS/size_of(T) n: stcepSizec_) { l| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(groupD /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63e:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here v 63 | F Pruimitivnes, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRuice_RInNG_SIMPgLE_Prod_u64_2, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62:< T, Rnote: edOp, Prexpanded from macro 'DEFINE_ncclDevFunc'oto, COLL_ UNRO LL>(ti611d, | nthr eads , w ork) ; | ^R /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:u432:78: nWorkBatch, algo, proto, unnote: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here r 432 | o l ifl (ti>d < (subt)n) R.unWorrkCuoll().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_n | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ cclDevFunc(AllReduce/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_S_RING_SIMPLE_Prod_u6/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u64_4, ncclFuncAl4_2, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALlReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ GO_RING, NCCL_PROTO_SIMPLE, 2) | ^IMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(t_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_4, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ hreadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u64_2, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u64_4, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_4, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_4, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unrol/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmel>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tidm).comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_4, ncclFuncAllReduce,, nthreads(nthreads), tidInBlock(threadIdx.x), group( FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algroup), | ^~~~~~~~~~~ go, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_4, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_4, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u64_4, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid),/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_4, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UN/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u64_4, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_4, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSi_RINGz_SIMPLE_Prod_u64_4, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.he:s[670NCCL:_PROTO_SIMPL15E]/N:CCL _STEnote: PS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*field 'nthreads' will be initialized after field 'tidInBlock' D 670 | i tride(tidc), ntthre=ads(*nthr/eads0), ti,dInB locPk(thrreadoIdx.tx), goroup,(gro up),0 | ^~~~~~~~~~~~~~~~~> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670 :60: note: pfield 'group' will be initialized after field 'stepSize' 670r | ims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | run tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ TreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_4, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u64_4, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unr/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, woroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.xk)); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u64_4, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), grou, group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ p(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u64_4, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u64_4, ncclFuncAllReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 22 warnings generated when compiling for gfx90a. 18 warnings generated when compiling for gfx908. 18 warnings generated when compiling for gfx1201. 18 warnings generated when compiling for gfx1030. 18 warnings generated when compiling for gfx906. 18 warnings generated when compiling for gfx1100. 18 warnings generated when compiling for gfx1200. 18 warnings generated when compiling for gfx1101. 18 warnings generated when compiling for gfx1102. [ 64%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_u8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_u8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_u8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_u8.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, he/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:a12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.hd:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: ,warning: unused variable 'y' [-Wunused-variable] 77 | uinmt32_t y, ahead, mantnissa; | ^ tissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_bIn file included from y_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = SIZEt; \ | ^ hreadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, daIn file included from ta2, flag2; /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.hIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from :/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:17475:7: warning: : unused variable 'w' [-Wunused-variable] 75/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h | :barrie75r_by_g:roup()7; | ^~~~~~~~~~~~~~~~~~: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29 :15: note: expanded from macro 'barrier_by_group' warning: 29 | unused variable 'w' [-Wunused-variable] const int w = thre adIdx75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29.x/WAR:P_SIZE;15 \ | ^ : In file included from note: expanded from macro 'barrier_by_group' 29 | constIn file included from int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from flag/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h2; | ^~~~~: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145145::21:14 warning: unused variable 'flag1' [-Wunused-variable] :/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: 145In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: | In file included from warning: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.hunused variable 'data1' [-Wunused-variable]:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barri145uer_b | iy_gr nouIn file included from uintt32_t 3dat2_t da1, flIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ p(a); | ^~~~~~~~~~~~~~~~~~ g/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15:1 note: expanded from macro 'barrier_by_group' 29, | data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1,/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h :173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:f75:7: warning: lunused variable 'w' [-Wunused-variable] a constg in75t 1w | = ,th read Id x.x/dW ARaP_ SIZtE; \ a | ^ 2, flag2; | ^~~~~ barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29ata1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ | const int /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; w | = th ^~~~~read Idx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | bIn file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cppunused variable 'w' [-Wunused-variable] 80 | b:2a: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27r:15: rwarning: unused variable 'bid' [-Wunused-variable] ier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | 27 | const int bid = ncclShmem.channelId - work->channelLoIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ const int w = threadIdx.x/WARP_SIZE; \ | ^ ; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from :218:15:/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | arrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ uint64_t* ptr = rec warning: vunused variable 'bid' [-Wunused-variable] P218 | t cornst (int 0bid )= nc+clShlmem.lchan1nelI2d 8- woOrk->fchanfnelLos; | e ^~~ t; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, datIn file included from a2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35:/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cppd:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ ata1, flag1, data2, flag2; | ^~~~~ = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->cIn file included from hannelLo; /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp :2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2:218:15: warning: : In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flaIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ g1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ , flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint3/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ 2_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ 218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = nc/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ clShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_2, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ LL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_2, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_2, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_2, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_2, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_2, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_2, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_2, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_u8_2, ncclFuncAllReduce, FuncProd, uint8_t, NCCIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_2, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ dx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_2, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) In file included from { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(grou, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFup)n, | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ c | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 (509 | AstepSilze(nccllShmem.Rcomm.beuffSizeds[NCCLu_PROTOc_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work-e_TREE_SIMPLE_Prod_u8_2, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | Ru>nrecWvbuffo, worrk->kredOBpArga, 0*tProtco::MhaxGr, ProtoLL128, 2>' requested here 1070t | y runT,reeS plit, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15:T, RedOp, ProtoLL128, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_u8_2, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TR/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hEE, NCCL_PROTO_LL128, 2) | ^ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInB/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hl:611:o62: cnote: expanded from macro 'DEFINE_ncclDevFunc' k 611( | t RuhnWorrkBeatach,, al go,g prrotoo, uunropll>(().grunr();o \ u| ^ p), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_2, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), grouIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | rp(group), | ^~~~~~~~~~~ In file included from unTreeUpDown/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp,:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nt hCOLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_2, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x),reads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof( group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Prod_u8_2, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_2, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInIn file included from Block(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_L_UNROLL>, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work);/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_2,SIMPLE_Prod_u8_2, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u8_2, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ (nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78::670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | note: tidin instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here(tid), n thread s(nthre432 | if (ads), tidInBlock(threadtid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:12:1: note: Iin instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested heredx.x), group(g roup), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 12 671 | | stepSizDe(stepSEize_ ==/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSiF 0 ? ncclIShmem.cNomm.bufEfSze_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_2, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ize_s[NCCL_PRnOTO_ScIMPclDevFunc(AllReduce_RING_SIMPLE_LE]/NPCCL_STEPrS/sizeoof(T) : dstepSize__) {u8_2, ncclFuncAllReduce, FuncP | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ r | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:o303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested hered 303 | , Pri mituint8_t, Nives, algo, prmmetroic<1, NtCCL_MAXo_DEV_ARI,TY>, /* Direct=*/u0, Proton, 0> prrims | ^ o/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565ll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | :t5: note: iin instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565d | ( runtTreeiUpDodwn, sCOLL(_UNRnOLL>t(tidh, nrthreeadsads), tidInBlock(threadIdx.x, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (t), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ id < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_2, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidI/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:nBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_2, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_2, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u8_2, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod>_(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_2, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ u8_2, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthr e | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_2, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, prads(nthreads), tidInBoto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ lock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hin instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | eads(nt hreads),i tidInBlofck(thread Idx.x), (group(grotup), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~i | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | d step Size(step().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_4, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ of(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | PrimitPivesr, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | imitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_4, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u8_2, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_4, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u8_2, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' x), 670 | tid(tid), nthreads(nthreads), tidInBglock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | troupid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ (group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIM/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here PLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Prim558i | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u8_2, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u8_2, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h(AllReduce_TREE_SIMPLE_Prod_u8_4, ncclFuncAllReduce, FuncProd, uin:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] t8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock'670 | tid(tid), n670thread | s(nthr eads), tid(tid), nthreads(nthreads), tidInBlock(thread tidInBIlockd(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~x.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), | group(group g/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:r63:56:o note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here u 63 | p P(rimigtivers,, 0, Pro to, | 0> p ^~~~~~~~~~~rims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u8_2, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primiti/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] ves, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:17:1mmetric<1>, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(t:i note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_4, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ k); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_4, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatcLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u8_2, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ op, algo, proto, unroll>().run(); \ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid )| ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_4, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), ti, nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ dInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_4, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_SL_PROTO_SIMPLE]/NCCL_STEPTS/sizEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_Prod_u8_2, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ eof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_4, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.co/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_4, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ X_DEV_ARITY, 1>, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_4, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nt/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_4, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work);| tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u8_2, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work);/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u8_4, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock:670:15: (warning: initializer order does not match the declaration order [-Wreorder-ctor] t670 | h tid(trid), nethreaads(nthdreads)I, tidIndBlockx(threa.dIdx.x)x, grou)p(grou,p), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ g671 | r stepSioze(stup(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid),epSiz e_ == 0n ? nctclShmehm.comm.buffSizers[NCCL_ePROTOads(nthreads), tidInBlock(threadIdx.x),_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: gro up(grnote: oup), in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here | ^~~~~~~~~~~ 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u8_4, ncclFuncAllReduce, FuncProd, uint8_t, NCC>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h(ALl_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ lReduce_TREE_SIMPLE_Prod_u8_4, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), :n670:15:t warning: initializer order does not match the declaration order [-Wreorder-ctor] h 670 | r etid(atid)d, ntshrea(ds(nnthretads)h, tirdInBelocka(thrdeadIsdx.x)), g,rou p(grtoupi), d| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ I| tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ n671 | B stlepSioze(sctepSkize_( == t0 ? hncclSrhmeme.coamm.budffS/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); Idx.x), group(group), | ^~~~~~~~~~~ izes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown| ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u8_4, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_4, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u8_4, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_4, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_4, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_4, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(n/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclSthhreadsm), tiedInBlomck(thr.eadIdx.cx), grooup(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ mm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_4, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInB | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_4, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthre670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_4, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(thre ifa (tidd < subItn) RdunWorxkColl.().runp(tid,( subgroup), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | tn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_4, ncclFuncA steplSize(lstepSRize_ =e= 0 d? ncuclShmecm.comem.buf,fSize s[NCCFL_PROuTO_SIMPLE]/NCCnL_STEcPS/siPzeorod, uint8_f(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), groin instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkCollhre(ads)(nt.hreards)u, tnidI(nBltocik(dth, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:22:1:readIdx.x), group(group), | ^~~~~~~~~~~ note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u8_4, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u8_4, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u8_4, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | 18 warnings generated when compiling for host. DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u8_4, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u8_4, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u8_4, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Prod_u8_4, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_prod_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Prod_u8_4, ncclFuncAllReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for gfx1201. 18 warnings generated when compiling for gfx1030. 18 warnings generated when compiling for gfx942. 18 warnings generated when compiling for gfx908. 18 warnings generated when compiling for gfx1200. 18 warnings generated when compiling for gfx906. 18 warnings generated when compiling for gfx1101. 18 warnings generated when compiling for gfx1100. 18 warnings generated when compiling for gfx1102. 22 warnings generated when compiling for gfx90a. [ 65%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_bf16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_bf16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_bf16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_bf16.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:1: In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable]In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ o; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrierIn file included from _by_g/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hr:11: In file included from oup(); /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h: 75:7: warning: unused variable 'w' [-Wunused-variable] | 75 | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | barrie r_by const i_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ nt w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ uint64_t* ptr = recvPtr(0)+ll128OffsetIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h; | ^~~ :11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const iIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ nt bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, , /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown<2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ T, RedOp, ProtoSimple<1, 1, COLL_UNROLL>, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthrIn file included from eads, wor/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cppk); | ^: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:278: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here : 432 | In file included from if /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h(tid:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | ti d< subtn() tid),Ru nWorkConll().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:12:1: note: (nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLEin instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12] | DEFINE_/ncclDevNFunc(AlClReduceC_RING_SLIMPLE_Sum__bf16_2S, ncclFTuncAllREeduce,P FuncSuSm, hip_/bsfloizeof(T) : sat16,t NCCL_ALeGpSO_ize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ R| ING, NC group(groupCL_PROT O_/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, pro Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown ().ruRn(); e\ | ^d /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670O:15: note: pfield 'nthreads' will be initialized after field 'tidInBlock' 670 | , t id(tiPd), ntrhreados(nthtreadso), tiSdInBloick(thmreple<1, 1, COLL_UNROadILdx.x)L, gro>up(gr,oup), COLL_UNROLL>(tid, nthreads, wor | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.xk)); , | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hgroup:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(thread/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] Idx.x), group( g670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_4, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().runroup), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? nccl(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Shmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_4, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), g:670r:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] o 670 | u tid(tpid), (nthreadgs(nthrroup), | ^~~~~~~~~~~ eads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7ROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloa | tDEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidIn/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(Bltock(threhadIdx.x)r, group(egroup), a| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ d 671 | IstepSized(stepSizex_ == 0 ?. ncclShmxem.comm.)buffSizes,[NCCL_PR OTO_SIMPgLE]/NCCLr_STEPS/siozeof(T) u: stepSipze_) { (| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hgroup), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | step:63:S56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here i63 | Prizmitives, 0, Proto, 0> prims | : ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: 303note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | : runRin90g, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives(tid,F nthreaads, wnork); A | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: snote: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here y432 | m if metric<1, NCCL_MAX_DEV_ARITY>, /*Direct=*/0, Pro(tid < subtn) RunWorkColl prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(OLtL_UNROiLL>().drun(ti, nthreadd,s sub,tn, work)w; o| ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cppr:12:k1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here) 12; | DEF INE_ncclD evFu| nc( ^AllR educ/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.he_RING_SIM:PLE_432Sum_:bf16_782, n:ccl Funcnote: Allin instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested hereRedu ce, FuncSum,432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_4, ncclFuncAllReduce, FuncSum, hip_611 | b In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173 fRunWolrkBaotch, aNlgo,C proCto671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().Lrun(G); \O | ^_ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:T670:15:R note: field 'nthreads' will be initialized after field 'tidInBlock' E 670 | E t,id (tidN), nCthreCads(Lnthre_ads)P, tiRdInBOlockT(threOadId_x.xS), gIroupM(groP, RedOpu, Algo, Proto, COLL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, pp)L, | ^~~~~~~~~~~~~~~~~E /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:,670:60: note: field 'group' will be initialized after field 'stepSize' 4) | 670 | tid(tid), nt^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ roto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), threaids(ndthreaIds),n tidBInBllock(othrecadIdkx.x)(, grtoup(hgrourp), e | ^~~~~~~~~~~ adIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthre/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDIn file included from ads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ own, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_4, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_4, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(ti:d, nthre15ads, wo:rk); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: warning: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | initializer order does not match the declaration order [-Wreorder-ctor] if (tid < su btn) RunWorkCo670 | ll().run(tid, subtn, work); tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ ==P S/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing, algRo, proOto, unroLll>()L.run(); >\ | ( ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15t: note: field 'nthreads' will be initialized after field 'tidInBlock' i670 | dtid(tid,), nt hreadsnthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid <(n threadss), tiudInBlobck(thretadIdx.nx), gr)oup(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RIN), group(group), | ^~~~~~~~~~~ G, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthrIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] eads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf16_4, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCC 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ L_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_4, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf16_4, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hNCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: ze_note: ) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90 : note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 432 | if (tid < subtn) RunWorkColl <303 | F Primintives().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:7:1: <1,note: NCCLin instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here_MAX_ DEV_ ARITY>, 7/*Dir | ect=*D/0, PEroto, F0> priIms | N ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:E5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here_ 565 | n rucnTreecUpDolDevFunc(Awn,llReduce_TREE_SIMPLE_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h COLL_UNRO:LL>611(tid,: nth62read:s, wo rk); note: | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hexpanded from macro 'DEFINE_ncclDevFunc' 611 | :432:78 : note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | R if (tid < suubtnn) RuWnWorokColrl, algo, proto, unroll>().run(); \ COLL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllRed | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ uce_TREE_SIMPLE_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PRO/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | TO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_4, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_4, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDe/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hvFunc(AllReduc:670:e15: warning: initializer order does not match the declaration order [-Wreorder-ctor] _670 | T tid(tiRd), nthEreads(nEthreads_), tidInSBlock(tIhrMPLE_Sum_beadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSf16_4, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algoize(s,tepSi ze_ =p= 0 ? rncclSohmem.tcomm.obuffS,izes[ NCCL_PuROTO_nSIMPLEr]/NCCoL_STlEPS/slizeof>(T) : (stepS)ize_) .{ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r | group(group u/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hn:63:56:( note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here )63 | ; Prim it\ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(ives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads ) run,Ring (tiod,c nthkreads(, wotrk);h | ^r /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.he:432:78:a note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here d I432 | idf x.x), group(group), | ^~~~~~~~~~~ (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_4, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_4, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf16_4, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ nAsymmetric<1, NCCL_MAX_DEV_ARITY>, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), , /*Direct=*/0, Proto, 0> group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hprims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthread:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); s, work); | ^/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432| ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | | if (tRunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tiid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllR/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf16_2, ncclFuncAllReeduce_TREE_SIMPLE_Sum_bf16_4, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, duce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_4, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL:_670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] S 670 | Ttid(tid), Enthreads(Pnthreads)S, tidInBl/ock(threasdIdx.x), igroup(grozup)eof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h, | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | s:tepSize(s254tepSize_:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitiv e== 0 ? nscclShmem., /*Direct=*/0, Proto, 0> prims ze_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf16_4, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_4, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf16_4, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_4, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_R/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hING_LL128_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hb:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63f16_4, ncclFuncAllReduc | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf16_4, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ e, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_4, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf16_2, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(g:roup), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | 670 tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | :stepSiz15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | e(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROtid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown<:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid T<, R edOp, sProtouSimplbe<1, 1t, COLnL_UNRO)LL>, COLL_RUNROLuL>(tidn, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:17WorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Su:1m: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here_ 17b | DEFfINE_1nccl6DevF_unc(4AllRe,duce _TREnE_SIcMPLEc_Sum_lbf16F_4, uncclFnuncAcllReAduce,l FulncSuRm, heip_bdfloatu16, cNCCLe_ALG,O_T REE,F NCuCL_nPROTcO_SIMSPLE,um, hip_bfloat16, NCCL_ALGO_RING 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo,, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthread spro)to, unrol,l>( ).rtun(i); d\ I| ^ n/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:B670:15l: note: ofield 'nthreads' will be initialized after field 'tidInBlock' c670 | k (tidt(tihd),r ntehreaadsd(ntIhredadsx), .tidxInBl)ock,(th reagdIdrx.xo), ugropup((grgoupr), o | ^~~~~~~~~~~~~~~~~u /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threap)d, I| ^~~~~~~~~~~~~~~~~ d/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670x:60:. note: field 'group' will be initialized after field 'stepSize'x )670 | , group(gro tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ up), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_4, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf16_4, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_4, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_4, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf16_4, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf16_4, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf16_4, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf16_4, ncclFuncAllReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for gfx908. 18 warnings generated when compiling for gfx1201. 18 warnings generated when compiling for gfx1100. 18 warnings generated when compiling for gfx1030. 18 warnings generated when compiling for gfx942. 18 warnings generated when compiling for gfx1200. 18 warnings generated when compiling for gfx1102. 18 warnings generated when compiling for gfx1101. 18 warnings generated when compiling for gfx906. 22 warnings generated when compiling for gfx90a. [ 65%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_bf8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_bf8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_bf8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_bf8.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_groIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ up(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = thIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ readIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | constIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ int w = threadIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ Idx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrie const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: r_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmemIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ .channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from :366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.cha/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ nnelId - work->channeIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ lLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_2, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_2, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hx.x), group(gr:o670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_2, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ up), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_2, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_2, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_2, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/:s254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested herei 254 | z Primiteives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLe_L) { | > ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group (/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:t56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here i63 | Primitives, 0, Proto, 0> primd,s nthre ads, w ork); | | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h ^:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h | if:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | (ti d < subt n) R unWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_ibf (tid < subtn) RunWorkColl, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ >().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:12:1: note: fin instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here8_2, ncc lFuncAllR12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf8_2, nccleFduceu, Fnunc/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_2, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Sucm, Arclcl_blfloRat8e, NCCL_AdLGO_TREE,u NCcCL_PReOTO_,SIM PLFE, 2)u | ^n /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hc:611:62S: note: uexpanded from macro 'DEFINE_ncclDevFunc' m611 | , Ru nWorrkBatcch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15_b:floa t8, note: NCCLfield 'nthreads' will be initialized after field 'tidInBlock'_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_2, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 62: note: expanded from macro 'DEFINE_ncclDevFunc' 670 | 611 t | id( RunWorkBatch, algo, proto, unroll>t(id)), nt.hreadrs(ntuhreands),( tid)InBl;ock( thre\adId x.x), gro| up(g ^rou p), /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670::60670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid),: note: field 'group' will be initialized after field 'stepSize' n 670t | h tidr(tide), nathrdeadss(n(nthreads), tidInBthrleadso), ticdInBklock((ththreadIreadIdx.x), group(group), | ^~~~~~~~~~~ dx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthr5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTeads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_bf8_2, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ reeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_2, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.co stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINmm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_2, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ E_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_2, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_bf8_2, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_2, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_bf8_2, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_4, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_2, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_2, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_2, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(teid, nth_reads, wor== 0 ?k ); | ^ n/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:c432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested herec 432 | l iSf (tid h< subtmn)/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] RuneWom670. | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_2, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ comm.bufrkCfoSizes[NCCL_PROTO_SIMPll ().rusn(tidt, subetn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevpSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*Fu/nc(Al0lRed,uce _RINGP_SIMPrLE_Suom_bf8t_2, ncoclFun,cAllRe duce0, Fun>cSum, rccpl_bfloraims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: t8,in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here NCC L_ALGO_RING , NCCL_565PROT | O_SI MPL runTreeUpDown, algo, proto, unroll>().runLL_UNROLL>, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here ()432; \ | | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15 : note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid( tif (tid < subtn) RunWorkColl), ti(dI)nBlo.ck(trhreaudIdx.nx), (groutp(groiup),d | ^~~~~~~~~~~~~~~~~, /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h: 670:60:s note: ufield 'group' will be initialized after field 'stepSize' btn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:7:1: 670 | note: tid(in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested heretid) , nt hreads(nt7hrea | DEFINE_ncclDevds), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Func(AllReduce_TREE_SIMPLE_Sum_bf8_2, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid\ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf8_2, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_2, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_2, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_2, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_4, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_2, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(g/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_4, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ roup), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h671 | stepSize(stepS:670:15i: warning: initializer order does not match the declaration order [-Wreorder-ctor] z670 | e ti_d(tid) , nth=reads=(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | PrimitivesL_PRO,TO_ SIMPL0E]/NC,CL_S TEPS/sPizreof(To) : tstepSoize_), { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h0:63> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runR:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prinig(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | s | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h :558: 5:i note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here f 558 | (runRting(teid, dnthrOeadps, wo,rk); Algo, Proto, COLL_UNROLL>().run(tid, subt n| ^, /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h: 432:78:w note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here o 432 | r k if) (ti;d < subtn ) Ru| nWor ^kCo l/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:12:l1, 1, 2, 2>::run' requested here 12 | DEFINE_RedOp, Algo, Proto, COLL_UNROLL>().run(tid, subtn, wncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf8_2, ncclFu/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf8_2, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().runncAllReduce,o rkF); | ^ u/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cppn:12c:1:S note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested hereu m12 | DE,FI NrE_nccclDcevFlunc_(Alblfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^Reduc e_/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hRING_SI:MPL611E_S:um_62bf:8_2 , ncnote: clexpanded from macro 'DEFINE_ncclDevFunc'Fun cAl lRedu611ce, | Fu ncS um, rc cl_Rbfluoatn8, WorkBatch, algo, proto, uO_nSIMrPLEo, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' ll >(611 | RunWorkBatchr, aolguo, pprot)o, ,un roll >()| .run ^~~~~~~~~~~~~~~~~(); \ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h | ^ :/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670670:15:: 60note: field 'nthreads' will be initialized after field 'tidInBlock' : 670 | note: tfield 'group' will be initialized after field 'stepSize'id( tid ), nth670reads(n | th rteaids)d, (tti/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf8_2, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ id), nthreadsd(InBlnockt(thhreardIdex.ads), tidInBlock(threax),d grIoup(dgrxoup.), x | ^~~~~~~~~~~~~~~~~) /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h,:670:60 : note: gfield 'group' will be initialized after field 'stepSize' r670oup(group), | ^~~~~~~~~~~ | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf8_4, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ cclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf8_2, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_4, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf8_2, ncclhreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf8_2, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ FuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | i d), nthr eads(nth reads), t idInBlocik(In file included from threafdIdx.x), g/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_Sum_bf8_2, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ roup((groutid < p), s| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ubtn) RunWorkCol | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ l().run(tid, subtn, em.cwomm.bufofSizesr[NCCL_kPROTO_)SIMPLE;]/ NC| ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:CL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_4, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, 254:90:N note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here C254 | C PrimiLtives<_T, RedPOp, FanRAsymmeOtric, /*IDirectM=*/0, PProLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | t o, 0> p RunWorkBatch, ProtoSimple<1, 1, 4>, 4>' requested here > 565 | , runTareelUpDowgn, COLL_UNROLL>(tid, nthreproto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] tid(tid), nthre 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_4, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ads(nthreads), tidInBlock(threadIdx.x), grouads,p wor(k); g | r ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.ho:432:u78: pnote: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here )432 | , if ( tid| < s ^~~~~~~~~~~~~~~~~ubtn ) Ru/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hnWorkCol:l().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8group(group), | ^~~~~~~~~~~ _4, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hgroup),:670: 15: warning: initializer order does not match the declaration order [-Wreorder-ctor] | 670 | ^~~~~~~~~~~~~~~~~ t /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.xid(t)id),, nth readgs(ntrhreaods),u tidpInBl(ock(gthreradIdox.x),u groupp(gr)oup,), | ^~~~~~~~~~~ | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_4, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_4, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf8_2, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_4, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | ste:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] p670 | tSid(tid), inthreads(znthreads),e tidInBlo(ck(threadsIdx.x), grtoup(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : steepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, PropSitze_) {o | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDowno prims t | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.ho:565:5: note: Sin instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565impl | e /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h<1, 1,r unTreeCOUpLL_UNROLLDown, COLL:670:15:_ warning: initializer order does not match the declaration order [-Wreorder-ctor] U670 | N tidR(tid),O nthrLeads(Lnthre>ads), (tidInt>, BCiOLlL_dUNoRO,LLc>( tidk, nnt(hrteatdshh, rwrorke)e; a| a ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hdd:432:sI78: note: ,din instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here x432 | w. ixof (t)ridk); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here , group(432group | ), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_4, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ NROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DPLEE]/FNCCIL_STNEPSE/si_zenofcclDevFunc(Al(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDow_PnROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_4, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ RunWorkBatch, algo, proto, unrop, (Pro)toS.implre, COLL_UNROLL>(ti; d\ ,| ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hn:670:t15: hnote: field 'nthreads' will be initialized after field 'tidInBlock' r e670 | a dtisd(t, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested hereid) , n thread432s(n | thr ead s), if (tid < sub ttidInBnloc)k(t hrReaduIdxn.xW), gorourp(gkrColl().run(tid, subtn, work); | ^ oup), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_4, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor]al go, proto,:670 | u670n r:o 15ll>().run( ); \ t| ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hi:670d:15:( note: field 'nthreads' will be initialized after field 'tidInBlock't i670 | d ) ti,d(t id)n, nthreadst(nthhrerades),a tiddInBslock((thnreatdIdhx.xr), egroaup(dgros:u )warning: p,initializer order does not match the declaration order [-Wreorder-ctor] 670t) | i, d tI id(| tid ^~~~~~~~~~~~~~~~~), nt/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hhreads:(nt670hre:ads60), :ti dInnote: Blofield 'group' will be initialized after field 'stepSize'ck( ntB hlocrk670(et | ha reda dII ddx .xxt).,i xgdr)o(u,ptid), nthreads(nt grhoupr(greoupa), d | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ s | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) (,g671r o | upt )i, d | I ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ n | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_Bs lt671oe | cp kS s(itetzpSheizr(e(sesteatpSdeizIpe_dS =xi=.zx),e_ == 0 ? ncclShmem.comm.buf 0 f? nSccliShmzem.ecomsm.b[uffNSizCes[CNCCLL_P_ROTPO gR_roOSupTI(gOMro_PupSL),IE M]| ^~~~~~~~~~~ P/LE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primiti | v group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.he:63s:56: , FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here T 63, | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558RedO | p , F anA sym mrunRinge,t /o*D,ir ecCt=O*/L0,L Pr_otUo,N 0R> OprLimLs > | ( ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.ht:i565:d5:, note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here n t565 | h r eruanTdreseU,pD owwn, 1, 2, 4>::run' requested here 432 | if (tid < subtn) ReRdOp, ProtoSimple<1, 1, COLL_UNROLL>, COLL_UNROLL>(tid, nthreads,u nWworokCrollknote: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here( )432 | . r u nif( /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_4, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | (tiDd E< FsuIbNtnE) _RunnWcorckClolDlN()G.r_uSn(ItiMd,P sLubEtn_, Swourkm)_bf8_4, ncclFuncAllRed;u c| ^ e/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp,:17 :1F: unote: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested heren c17S | DuEFmIN,E_ ncrclcDevcFulnc_(AbllfRelduocea_TRtE8, E_SIMPLE_Sum_bf8_4, ncclFuncAllReduce, FuNCnCL_cALSGOu_RmING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | , rRccul_nbWflooart8k, BNCaCLt_AcLGOh_T, algo, proto, unroll>().ru, nNC(CL_)PR;OT O_\SI MP LE| , ^ 4 ) /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h::670611::62:15 note: :expanded from macro 'DEFINE_ncclDevFunc' 611note: | field 'nthreads' will be initialized after field 'tidInBlock' R unWo670rk | Ba tc h , anlgto,hreads(nthreads), tidInBl oprcotko,( utnrholrl>e()a.rdunI()d; x\ . | ^x /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h):,670: 15:g note: rfield 'nthreads' will be initialized after field 'tidInBlock' o u670 | p( tid(tid), nthreads(nthreads), tidIgnroBupl)o, c| ^~~~~~~~~~~~~~~~~k /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h(:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreadthsre)ad,Id x.tx)i, dgrIounp(Bgrlouop)c,k(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf8_4, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf8_4, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_4, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hunroll>().:run(670); :\ | ^ 15/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670::15: note: field 'nthreads' will be initialized after field 'tidInBlock' warning: 670 | initializer order does not match the declaration order [-Wreorder-ctor] tid (tid) , nt670 | tid(tihrdeads()nthre,ads), tidInnBlockt(threhadIdx.x), grroupeads(nthreads()group,), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670tidInBlock(threadIdx:.60: note: field 'group' will be initialized after field 'stepSize'x 670) | t,id( group(tigd), rnthreoads(unp), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ threads), tidInBlock(threadIdx.x), group(g | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)r oup):, | ^~~~~~~~~~~ stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_4, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group),/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_4, | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf8_4, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf8_4, ncclFuncAllReduce, FuncSum/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(t, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | id ), nthre ads(nthrteads), tiidInBlocdk(thread(Idx.x), tgroup(grioup), d| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) 671 | s,tepSize( stepSinthreads(nthzre_ == 0 e? nccadslSh), tidInBlomem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSizck(threaedIdx.x_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h), group(group), | ^~~~~~~~~~~~~~~~~: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60303: note: field 'group' will be initialized after field 'stepSize' :670 | ti90d(tid),: nthreads(nthreads), tidInBlock/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h(t:670:15: hwarning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | r tid(teid)adIdx.x), group(group), | ^~~~~~~~~~~ , nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepS note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here i 303 | z Primeitives, S/*DirIect=*/0, ProtoM, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tPLEi]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto,d, n0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: thrnote: eadsin instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here, w ork) 565 | runTree; | ^U /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hp:432:78D: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested hereo 432w | if n(tid< < sTubtn,) Ru nWorkRColle().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllRdOep, PdrotouSimplce<1,e 1, _COLTREE_SIMPLL_UENROL_L>, SCOLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | iufm_bf 8_4,( ncctlFunicAlldRedu ce, rcc(l_bf)lo.arun(tid, subtn, work); | t ^8, NC/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cppCL_ALGO_T:REE17, N:CC1L_P:ROT O_Snote: IMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here note: 17field 'nthreads' will be initialized after field 'tidInBlock' | DE FINE _ncclD670evF | unc (Al lRed uce _TRtEE_iSd(IMPLE_Sum_bf8_4, ncclFuncAllReduce, FuncSum, rccl_bfloatti8d),, n thrNeaCdsC(ntLhre_adsA), LtidGInBOlock_(thTreaRdIdEx.xE), ,gro up(NgroCup)C, L| ^~~~~~~~~~~~~~~~~ _/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:P670:60ROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWor: knote: field 'group' will be initialized after field 'stepSize' B 670 | a t ticd(thid)<, ncthroeadls(nlthr,ea dst), ytid,InB locrk(tehrdeadoIdx.px),< gty>, algoroup(group), | ^~~~~~~~~~~ , proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf8_4, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf8_4, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf8_4, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_bf8_4, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf8_4, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_bf8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_bf8_4, ncclFuncAllReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for host. 18 warnings generated when compiling for gfx1102. 18 warnings generated when compiling for gfx1101. 18 warnings generated when compiling for gfx942. 18 warnings generated when compiling for gfx908. 18 warnings generated when compiling for gfx1030. 18 warnings generated when compiling for gfx1100. 18 warnings generated when compiling for gfx1201. 18 warnings generated when compiling for gfx1200. 18 warnings generated when compiling for gfx906. 22 warnings generated when compiling for gfx90a. [ 65%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_f16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_f16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_f16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_f16.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrierIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ _by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35:In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ :2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const iIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ nt bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const :366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ o; | ^~~ int w = threadIdx.x/WARP_SIZE;In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uintIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ 32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable]/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: 145 | unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ : warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ , flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_gro/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15:up() ; | ^~~~~~~~~~~~~~~~~~ warning: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: unused variable 'bid' [-Wunused-variable]note: expanded from macro 'barrier_by_group' 29 | co nst int w = t218hreadId | x.x/WAR P_SIZE; \ | ^ In file included from const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1,:366: 15: warning: unused variable 'bid' [-Wunused-variable] f366 | l const aint bidg = ncc1, data2, lShfmem.chalnag2; | ^~~~~ nel/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.hId - wor:145:35: warning: unused variable 'flag2' [-Wunused-variable] k->c hannelLo145 | ; | ^~~ uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from :218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174-: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75: work7: warning: ->chanunused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARnelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hP_SIZE; \ | ^ In file included from :366:15: warning: unused variable 'bid' [-Wunused-variable]/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_2, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_2, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_2, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:In file included from 7:/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from :/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670 :15: warning: initializer order does not match the declaration order [-Wreorder-ctor] note: 670 | in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested heretid(tid ), nthre ads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.7 | buffSizes[NCCL_PROTDEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_2, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBaO_SItMPLE]c/Nh { | , ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.ha:254:90: lnote: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here g254 | o Pri,mitiv es, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreadUpDsown<)T, R,edOp , PrtotoSiimplde<1, I1, CnOLL_BUNROlLL>,o COLcL_UNkR(threadIdx.xOLL)>(, group(group), tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().r| u ^~~~~~~~~~~ n(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_2, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:_PROTO_SIMPLE]/N15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | PLE_Sum_f16_2, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_2, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_2, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads,In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | tree->down, tree->down, work ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_2, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncSum<__half>, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_f16_2, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 7 670 | | tiDd(tidE), ntFhreadIs(nthrNeadEs), ti_dInBlnock(tchreadIcdx.x)l, groDup(groeup), v | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ F| tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ u671 | nstepScize(s(tepSiAze_ =l= 0 l? ncclRShmeme.comm.dbuffSuizes[NcCCLe_TREE_S_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) IMPLE_Sum_f16_2, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROT{O | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~_ | group(groupS /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hI:303:90:M note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here P 303 | L E Pr,imi tive2s, algo, protnoAsym,metr ic<1u, NCnCL_MrAX_DoEV_AlRITlY>, >/*Di(rect)=*/0., Prroto, u0> pnrims( | ^) /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:;565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here \565 | In file included from | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | trunTreeUpDown: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h,:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:C175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.hO:508:L29: warning: Lfield 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] _506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), id (ti| d), ~~~~~~~~~~~~~~~~~~ nt hre ads| (nt stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)hre ad s), 507tid | In Blo ck( thre adIwdxa.x)r, group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:U670NR:OLL60>(t:id, ntnote: hrefield 'group' will be initialized after field 'stepSize'ads, wo rk); 670| ^ | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h :432: 78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here t432 | i d i(f (ttpiiddIn )Bl<,o ck(nsthturehbadrtIdenx.a) x/WRARPdu_SsnIZ(WE)no, tr | hk ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ rC | eo warp(tid/WARP_SIZE al 508dl | s< )F fn, tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , T, RedOp, Algo, Proto, COLL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIlagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: MPLE_Sum_f16_2, ncclFuncAllReduce, FuncSum, halnote: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | f , pNrCiCmLs_(tiAdL,G On_tThRrEeEa,d sNSCpClLi_tP,R OtTrO_eSeI-M>PdLoEwn,, &2t)r e e| -^ >/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hup,: 611w:62o:r knote: -expanded from macro 'DEFINE_ncclDevFunc'> s en611d | b u f f ,R uwnoWrokr-k>BraetccvhbrreeddoOpp,, 0a*lPgroo,t op:r/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown:oMtaox,G ruonurpoWlild>t(h)).;r u n| ( ^) ;/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h \: 1070 :| 5 ^: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hnote: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncSum<__half>, ProtoLL128, 2>' requested here : 670:107015 | : note: field 'nthreads' will be initialized after field 'tidInBlock'r u nT670r | e e Sp tliidt(l(otcik,d COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_2, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ (th,r enatdhIrdexa.dxs),, gworrk)o;u p (gro| up ^) ,/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h | : ^~~~~~~~~~~~~~~~~432 :/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h78:: 670note: :in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here60 :432 | note: field 'group' will be initialized after field 'stepSize' 670 | i f t(itdi(dt ir(oup), | ) ^~~~~~~~~~~. run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_f16_2, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncSum<__half>, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_f16_2, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_2, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f16_2, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hMPLE]/NCCL_STE:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Prim iRedOp, ProtoSimple<1, 1, COLL_UNROLL>, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_2, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f16_2, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f16_2, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_2, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_2, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), t/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCiLdInBlo_ck(thPreadIRdx.x)O, gTroup(Ogroup_), | ^~~~~~~~~~~S IMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_2, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthrIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670eads), ti:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] dIn 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_2, ncclFuncAllReduce, FuncSum, hBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ alf, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIM/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15PLE]/NCCL_STEPS: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 /sizeof(T) : stepSi? ncze_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f16_2, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:ARITY>, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_2, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /S*Direict=*/z0, Preoto, (0> prsims t| ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.he:565:5p: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested hereSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primi 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_4, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f16_2, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work)/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | ; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc( tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_4, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ AllReduce_RING_SIMPLE_Sum_f16_2, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_2, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Diwrect=*/a0, Protro, 0> pprims | I ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565n:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here B 565 | lrunTreock(threadIdx.x/WAeURpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if f(tid , FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f16_2, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56 su:btn) RunWonote: rkColin instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested herel().r un(tPid, surbtn,i workm); | ^i /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpptives, 0, 2, 4>::run' requested here 17 | DEFFIanSymmetric<1>,NE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_4, ncclFuncAllReduce, FuncSum, hal 0, Proto,f 0> pr,ims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hNCCL_ALGO_TREE:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subt, NnCC) RunWorkCollL_P, algo,A prolto, ungrollo>(), Proto.,run() ; \ COLL_UNROLL> | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nth(r).rune(tid,a subtdn, worsk); (| ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:nthreads), 10:1t: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here i 10dInBlock(threadIdx.x), gr | oDEFIuNE_ncpclDevF(unc(AgllRedruce_RoIup), NG_LL128_Sum_f16_2, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_2, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*DirecIn file included from t=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown,S COLLi_UNROLLz>(tide, nt(hreadss, wtork)e; | ^p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:S432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested herei z432 | e if_ (tid < sub=tn) R=unWor kColl0()m.rum.buffSizes[NCCL_PROTO_n(tiSd, suIbtn, work);M | ^ P/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp::17L670:15::E warning: initializer order does not match the declaration order [-Wreorder-ctor] 1] 670 | :/ Ntid(note: Ctidin instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here)C, n tLhr _eads(Snthre17Tads | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_4, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ EPS/siz), tidInBlock(threadIdx.x), group(greooup), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] stfe(T)p : Ssteip 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_4, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ze(stepSize_ == 0 ? ncclShmem.comm.buffSizSizee_) s{ [| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ N| group(group C/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hC:63:L56: _note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here P 63 | R O PrTimiOtiv_esN, 0C, PCrotLo, _0> SpriTms E | P ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hS:558:/5: snote: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, 2>' requested here i 558z | e rounRfing((tid, nthreiaze_d) s{ | , ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group w/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:o303:90r: note: kin instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | ) ; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hPrimiti:432:78: note: ves, /*Direct=*/0,in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432P | r o t ifo ,(ti d <0 su>btn ) RpunWrorkiCollm, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f16_2,OL L>,n COcLL_cUNRlOLLF>(tuid,n ntchreaAds,l wolrk)Reduce, Func; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkCollS ( Ru)nWo.run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllrkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidIReduce_TREE_SIMPLE_Sum_f16_4, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ nBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_2, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_4, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_4, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), n/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ?thre ads(nthrneads), tcidInBlocck(threadIldx.x), grShmem.cooup(gromup), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~m | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | . stepSibze(stepuSizffSizes[NCCL_PROTO_SIMe_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) {PLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hprims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthrea | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: :in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_4, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 432 | if (tid ds, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, w RoedOp,r Algok, Pro)to, C;OLL_U NROLL >().ru| n(tid ^, /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cppsub:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFtnI, worNk); E | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp_ncclDevFunc(A:12l:1: note: lin instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12R | DEFIeduce_RING_SIMPLE_SuNmE_nccl_DevFufnc(Al1lRe6_2,duce_RING_SIMPLE_Sum_f16_2, ncclFuncAllReduce, FuncSum, h ncaclFulncAlflRed,uc e, FNuncSCum, hCalf,L NCC_L_ALAGO_RLING, NCCLG_O_RINGPROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWor, NCkCL_PBROTO_aSIMPtLE, c2) h | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h<:c611:62o: note: lexpanded from macro 'DEFINE_ncclDevFunc' 611l | , Ru ty, redop, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:n15Work:Batc h, a670lgo, | pro to, unrol l>().run( tid(tid), nth);r \ e | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIadds(xnth.reaxds)), t,idI nBlgockr(thoreaudIdpx.x(), ggrroupo(gruoupp), ) | ^~~~~~~~~~~~~~~~~, /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h :670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthread s| ^~~~~~~~~~~~~~~~~ (/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:n670:60:t note: field 'group' will be initialized after field 'stepSize'h 670r | e tiad(tdid)s, nt)hre,ads (ntthreiadsdInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_4, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f16_4, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_4, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_4, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ITY, 1>, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_4, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f16_4, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f16_2, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/si/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threazedof(T) : IstepSize_d) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | x group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:.56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | x Primi)tives,| tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | s 0, Protto, 0epSi> pze(sterpimsSiz | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.he_ :== 0 ? ncclS558:5: hnote: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | m e m.comm.buffSrunRing(tid, nthreads, work); | ^CL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f16_4, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ dOp, FanAsymmetric<1, NCCL_MAX_DEV_ARITY>, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_4, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_4, nccl/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] F 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreacde_RING_SIMPLE_Sum_f16_4, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc's(nt 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ hreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_4, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBloc/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hk(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_4, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(th:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] readIdx.x), g670 | tid(tid), nthrorup(group),e | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ a ds(nthreads), 671t | steipSize(sdInBlotepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) :ck(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | RedOp, FanAsymmetric<1, NCCL_MAX_DEV_ARITY>, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h :if (ti432d < su:btn) 78RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f16_4, ncclFuncAllReduce, FuncSu: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:17:1:m, hanote: lfin instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here, NC CL_A LGO_RING17, NC | CL_PRDOTO_ESIMFPLE,I 4) N | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62E_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum: note: _expanded from macro 'DEFINE_ncclDevFunc' 611f | 1 Run6WorkB_atch4, allgoFuncAllReduce,, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthrea dFunscSu(m, hnaltf, hNCCrL_AeLGO_TREEa, NdCCLs_PR)OTO, tidInBlock(_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(ntrhoupr), e | ^~~~~~~~~~~~~~~~~a /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hd:670:60s: note: )field 'group' will be initialized after field 'stepSize' ,670 | t tiid(dtidI), nnthrBeadsl(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f16_4, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthr:670:15:e warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | a tid(tdid), nthrseads(nthrea,ds), ti dInBlockw(thorrek); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:adIdx.x),432 grou:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | ifp(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : ste (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_4, ncclFuncAllReduce, FuncSum, pSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTrhalf, NCCL_ALGO_TReEE, NCCeL_PROTUpDown, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreaL_UNRdOLL>, sCOLL_(UNROLnL>(ttid, nthhreads, work); r | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.he:432:78a:ds), tidInBl note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINEock(thr_eadIdnx.x),c grocup(grloup),D | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ evFunc(AllReduce_TREE_SIMPLE_Sum_f16_4, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f16_4, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f16_4, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_4, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f16_4, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f16_4, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f16_4, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f16.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f16_4, ncclFuncAllReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for gfx942. 18 warnings generated when compiling for gfx906. 18 warnings generated when compiling for gfx1200. 18 warnings generated when compiling for gfx908. 18 warnings generated when compiling for gfx1101. 18 warnings generated when compiling for gfx1100. 18 warnings generated when compiling for gfx1102. 18 warnings generated when compiling for gfx1030. 18 warnings generated when compiling for gfx1201. 18 warnings generated when compiling for gfx942. 22 warnings generated when compiling for gfx90a. 22 warnings generated when compiling for gfx90a. [ 65%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_f32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_f32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_f32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_f32.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 18 warnings generated when compiling for gfx1201. In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ : /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid In file included from = ncclShmem.channe/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hl:27:15:/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h :366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ Id - work->channelLo; | ^~~ warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShIn file included from mem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ :218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ :366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier const int b_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ ag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | cIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ onst int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ 18 warnings generated when compiling for gfx1100. 18 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), groupIn file included from (group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threa), gdroup(gIroup),d | x ^~~~~~~~~~~ .x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), gro/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ up(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for gfx1102. /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_4, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_4, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:12:1: note: tid(tid), nthreads(nthreads), tidInBlock(threadId/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here x.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_4, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f32_2, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_4, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_4, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_4, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_4, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_4, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_4, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_4, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_4, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_4, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_4, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_4, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f32_4, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_4, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_4, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_4, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ >, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f32_4, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f32_4, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_4, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_4, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f32_4, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f32_4, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_4, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f32_4, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_4, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f32_4, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f32_4, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMP:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] LE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f32_4, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f32_4, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for gfx1200. /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f32_4, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f32_4, ncclFuncAllReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for host. 18 warnings generated when compiling for gfx1030. 18 warnings generated when compiling for gfx908. [ 66%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_f64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_f64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_f64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_f64.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:1: In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, fIn file included from lag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' expanded from macro 'barrier_by_group' 29 | const int w = 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ threadIdx.x/WARP_In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = thre/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ adIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ elId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channeIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ lLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = n/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ cclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | b/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ arrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.chann const int w = threadIdx.x/WARP_SIZE; \ | ^ elId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ tr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | constIn file included from int/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hflag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h::145:21: 218warning: unused variable 'flag1' [-Wunused-variable] 145 | : ui15nt32_t :data1, flag1, warning: data2, unused variable 'bid' [-Wunused-variable]flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h :145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | 218 | const int bid uin t32_t d=ata1, flncagc1lShmem.ch, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flagannelId - work->channelLo; | ^~~ 2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_2, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_2, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_2, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_2, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreid(tid), nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(steIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | pSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE Primitives, /*Direct=*/0, Proto, 0> prims | ^ ]/NCCL_STEPS/sizeof(T) : stepSize_) { | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkCo/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303ll().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_2, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_2, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_2, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_2, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_2, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_2, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.bu(threadIdx.x), group(group), | ^~~~~~~~~~~ ffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitp, ProtoSimple<1, 1, COLL_UNROLL>, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_2, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ FINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_2, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShm 432 | em.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_2, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] if (tid < subtn) RunWor670kColl().trun(tid, isubtn, wordk); | ^( /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:12:1t: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINEid), nthreads_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f64_2, ncclFuncAllReduce, FuncS(unthreamds), t,idInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(thrPrimitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreeaadIdx.dx), gsroup(,group ), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60:w note: field 'group' will be initialized after field 'stepSize' o670 | rk); tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_2, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direcs[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_2, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ t=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_2, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_2, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_2, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSi:670:15: warning: zinitializer order does not match the declaration order [-Wreorder-ctor] 670 | e tid(tid)_, nthreads( nthreads),= tidInB=lock(threa dIdx.x), 0group(grou p), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ ?| tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(nstepSize_ c== 0clShmem.comm.buffSizes ? [ncclShmemN.comm.bufCfSizesCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T[NCCL_PRO)TO_SIMPLE ]/NCCL_S:TEPS/si zeofstep(T) : stepSize/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tidS(ize_) { t | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:i63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here d63 | Pri)mitives, 0, tProto, 0>h prims r| ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5e: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558a | runRidng_) { | s< ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h(T:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested heren, 254 | t PrimihRtives, /*Di,Prect=*/0,r Proto, 0o> prims t| ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5o, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | i: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCf C(tid L< sub_tn) RPunWorkRColl, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here, COLL _UNR OLL>().ru63n(tid | , sub tn, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_4, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Primitives, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f64_2, ncclFuncAllReduce, FuncSum, doubldOp, FanSymmetric<1>, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | ife, NCC(L_AtLGOi_RIdNG, NC, alTgo, ,pro to,R unerodll>(O).rpun(,); \ A| ^ l/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hgo, Pr:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), ogto,r COoLL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f64_2, ncclFuncAllReduce, FuncSum, double, N/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hC:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSiCL_ALGO_RING,ze_ == 0 ? ncclShmem.comm.buffSizes[NCCL_ NCPROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_2, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' CL_PROTO_SIMPLEu,p(g rou2p),) | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h :670| :60:^ note: field 'group' will be initialized after field 'stepSize' /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h670 | :tid611(ti:d),62 nt:hre adsnote: (ntexpanded from macro 'DEFINE_ncclDevFunc'hrea ds) , tidI611nB | loc k(threadIdx.x), group(group), | ^~~~~~~~~~~ RunWorkBatch, algo, prot670o | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_2, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f64_2, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_f64_2, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, wor/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? nck); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here clShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f64_2, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_f64_2, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f64_2, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h), nt:670h:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] r670 | teid(tid), anthreadds(nthreadss), ti(dInBlock(nthreadIdxt.x), grohup(groupr), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ e | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | a stepSdize(stepsSize_ == )0 ? ncclShmem.comm.buffSizes[NCCL_, wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)=P=ROTO_S3IMPLE]/N)CCL_STEP,S/sizeo f(T) :g stepSirze_) { o | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hu:254:90p: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here (254 | gPrimitirves, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDownTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runT, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:17:reeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:4321: :note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 7817 | D:EFIN E_ncnote: clDein instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested herevFun c(Al lReduce_432TREE_SIM | PL E_Su m_ if (tid < subtn) RunWorkColl().f64_r4, nucn(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here clF uncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_f64_2, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | ,R uunronll>(W).roun(r); k\ B| ^ a/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.ht:670c:15:h note: field 'nthreads' will be initialized after field 'tidInBlock'< c670 | o tlid(ltid,), n thrtey, redop,ad s(anthlregados),, tidpInrBlooctk(oth,rea dIudxn.xr),o glrolup>(g(ro)up), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | .run(); \ | ^ ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclT, RedOp, Algo, Proto, COLL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_4, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), DnevFunc(AllReduce_TREE_SIMPLE_Sum_f64_4, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ threads(nthreads), tidInBlock(threadIdx.x),LE , g4)r o| ^u /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hp:(611:62g: rnote: expanded from macro 'DEFINE_ncclDevFunc'o u611p | ) , R un Wo| rk ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~Ba tc h<| co tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ll , ty,671 r | e do p< ty >,s atlgeo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60pS:iz enote: (sfield 'group' will be initialized after field 'stepSize'te pS ize_670 == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCC | tLid_(tSidT),E nPthSre/adssi(nztheroefad(sT) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, P), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ roto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_2, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f64_2, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f64_2, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ ==/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) :(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_4, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(A:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tllReduce_TREE_SIMPLE_Sum_f64_4, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().id)r, nthreuads(nnthread(s), ti)dInBlo;ck(thr eadIdx.\x), gr oup(gr oup), | | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ ^ 671 | st/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hepSize(stepSize_ :== 0 ?670 ncclS:hmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h: note: field 'group' will be initialized after field 'stepSize' 670 | tid(t:i670:d15: ), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(s note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here t 63 | e PprimiStiveis, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl<= 0 ? ncclShmem.comm.b/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ =uffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : s= 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f64_2, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | PrimitivesUNR,OLL >()0.,run( tiPdr, soubttn, owor,k); | 0 ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp>:12 :1:p note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested herer i12 | mDEFsINE _nc clD| evF ^unc (Al/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hlReduc:e_R558ING:_SI5MPL:E_Su m_fnote: 64_in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here2, nc clFunc558All | Re duce , F unc Sumr, duoubnle,R NCiCngL_ALGO_RING, NCCL_PROTO_SIMPL(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid E, 2) < | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hs:611u:62:b note: expanded from macro 'DEFINE_ncclDevFunc't 611n | ) R unWRorkuBatnch,l al(,). ruRn()e; dOp, Algo, Proto, COLL_UNROLL>(\ ) | ^. /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hr:670:u15: nnote: field 'nthreads' will be initialized after field 'tidInBlock' ( 670 | t i tidd(t,id) , nsthrueadsb(tnthnrea,ds), twork); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFIidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: NE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f64_4, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' field 'group' will be initialized after field 'stepSize' 670611 | | t id( tid) , nthrReadus(nnthrWeados),r tikdInBBlock(thareatdIdcx.xh), , algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_2, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h(ti:670:15: warning: dinitializer order does not match the declaration order [-Wreorder-ctor] 670 | , tid(tid) , nthreands(nthreatds), tidhInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work)ize;of(T) : step Size_)| { | ^ ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | : Prim17itives:, 0, 2, 4>::run' requested hereL_MAX_ DEV_AR ITY>, 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_/*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, algo, proCtOLL_oUNROL,L>, COLLu_UNnROLLr>(tiod, nlthrelads,> wor(k); ) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h.:432r:78: note: uin instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here n432 | ( )if (;tid < su\btn) Run Work| Col ^l(a).rdun(stid(, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFnthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreaudnc(sAll(Rednuce_tTREhE_SIrMPeLE_aSumd_f6s4_4), n,ccl FuntcAlilRedducIe,n FuBncSlum,o docublke, (NCthreadIdx.x), group(group), | ^~~~~~~~~~~ CL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h| ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkCol:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(l().run(tid,/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:17:1: :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCLnote: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_4, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO__PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_4, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ TREE, NCCL_PROTO_SIT) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives) ,| ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:/611:62:* note: expanded from macro 'DEFINE_ncclDevFunc'D i611 | r ReunWcorktBat=ch<*col/l0, ty,, rePdopr, talgoo, ,pr 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' OLL_UNROLL>, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl()670 | . r tiud(tnid)(, ntthreiadsd(nt,hre adss), utidbInBtlockn(th,rea dIdwx.xo), rgrokup()gro;up), | ^~~~~~~~~~~ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_4, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_4, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_4, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f64_4, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f64_4, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.htidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid ) tid(t,id), nt hreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_4, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_4, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtnt,id), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_4, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TRE/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670E:_SIMPLE_Sum_f64_4, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_4, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f64_4, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PRIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nOTO_SIMPLEthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_Sum_f64_2, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ , 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f64_4, ncclFuncAllReduce, FuncSum 1, COLL_UNROLL>, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), , COLL_UNROLL>().run(tid, subt n, wor| ^~~~~~~~~~~ k); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_4, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f64_4, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlocIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f64_2, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | k (thread Idx.x) , groutp(grouip), | ^~~~~~~~~~~~~~~~~d /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:(60: note: field 'group' will be initialized after field 'stepSize' t 670 | i tid(tdid), nt)hreads,(nthreads ), tnidInBlotck(thrheadIdx.rx), greoup(graoup),d | ^~~~~~~~~~~ s(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f64_4, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f64_4, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_4, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f64_4, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f64_4, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here l, ty, redop, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tid 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_4, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group)InBlock(threadIdx.x), group(group),, | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f64_4, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f64_4, ncclFuncAllReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for gfx1102. 18 warnings generated when compiling for gfx1200. 18 warnings generated when compiling for gfx1100. 18 warnings generated when compiling for gfx942. 18 warnings generated when compiling for gfx1101. 18 warnings generated when compiling for gfx906. 18 warnings generated when compiling for gfx1030. 18 warnings generated when compiling for gfx908. 18 warnings generated when compiling for gfx1201. 22 warnings generated when compiling for gfx90a. [ 66%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_f8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_f8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_f8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_f8.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp 18 warnings generated when compiling for gfx1200. 18 warnings generated when compiling for gfx906. 18 warnings generated when compiling for gfx942. 18 warnings generated when compiling for gfx1100. 18 warnings generated when compiling for gfx908. 18 warnings generated when compiling for gfx1101. 18 warnings generated when compiling for gfx1102. 18 warnings generated when compiling for gfx1201. 18 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const iIn file included from n/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.ht:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h :173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hw:75:7: warning: = threadIdxunused variable 'w' [-Wunused-variable] . 75 | baxrrier/_by_gWroup(A);RP_SIZE; \ | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: | ^ expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_grIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174o: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:u7: warning: unused variable 'w' [-Wunused-variable] p 75(); | | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145 :uint3282_t :data1 , flawarning: g1, dunused variable 'data2' [-Wunused-variable]ata2, flag 2; | ^~~~~ 145/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145 | :21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint3 2_t duata1,i flag1n, datta2, fl3ag2;2 | ^~~~~ _/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28:t warning: unused variable 'data2' [-Wunused-variable] 145 | data1, flag1, data2, f uint3l2_t daata1,g fla2g1, da;ta | ^~~~~2, f /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:lag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | 35 : warning: unused variable 'flag2' [-Wunused-variable] u145 | i nt32_t data1, fl uint32_t data1, flag1, data2, flag2; | ^~~~~ ag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_byIn file included from _group(/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp):2: In file included from ;/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h: | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | 11 : In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h :175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.hc:80:o5: nwarning: unused variable 'w' [-Wunused-variable] s 80 | t bairrinert_b y_wgrou p()=; th r| ^~~~~~~~~~~~~~~~~~ e/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ adIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cppIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = rIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: ecvPtr(0expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ )+ll128Offset; | :2: ^~~In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h: 11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15:. channelIwarning: d -unused variable 'bid' [-Wunused-variable] wo 27 | const int bid = ncclShmem.channelId - rk-w>choannrelLko-;>c | h ^~~ annelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.c:218h:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.cannhelIad -n wonrk-e>chlannIelLo; | ^~~ d - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - wo:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | rk->channelLo; | ^~~ const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: In file included from warning: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | u/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hi:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ nt32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->chan145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ nelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_:218b:15: warning: yunused variable 'bid' [-Wunused-variable] _218 | g cornst oint up(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | b id = nc clShmcem.cohannnelIds - wtork- >cint w = thrhanneelLoa; | d ^~~ Idx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE),In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread(In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ (tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp15:: warning: initializer order does not match the declaration order [-Wreorder-ctor] 2: 670In file included from | tid(tid)/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h,:11 : In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.hn:175t: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.hh:508r:29e: awarning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] d s(506 | n threads), tidInBlock(thre a tdidI(tdidx),. nxth)re,ad s(gntrhroeaudsp),( wgid(rtiod%uWApRP)_S,IZ E )| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | ste, warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpIpSnizBe(lstoepScizke_( == 0t ?h nrccealSdhImedm.xco.mmx.b/ufWfSAizRP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ es [N| CC warp(tid/WARP_SIZEL_ PR OTO508_SIMP | LE ]/ NC CL _flagThread((tid%4)==3), groupS(TEgPSr/soizueopf()T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90:, | note: ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | 254 | st ep Primitives, /*DiErPSe/sizeof(uint64ct_=*/t0,)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | P ro to , 0>p prrimims s | ^( /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.ht:i565:d5:- note: nin instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here t 565h | r ea rdusnTSrepeUlpDiotwn,up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ p, ProtoSimple<1, 1, COLL_UNROLL>, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtIn file included from n) RunW/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ orkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ProtoSimple<1, 1, COLL_UNROLL>, COLL_UNROLL>(tid, nthreads, work)/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.halgo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid):670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group, nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidI(ngroup), | ^~~~~~~~~~~ Block(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f8_2, ncclFuncAllReduce, FuncthSreauds(mnthr,ead s),r ticdIncBlolck(thread_Idxf.x),l grooup(agrtoup)8, ,| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ N 671C | C sLtep_SizAe(stepSizeL_ =G= 0O ? _ncclRShmIem.NcomGm.,bu fNfSizCes[CNCCLL_P_ROTPO_SIRMPLOE]/TNCCOL_S_TEPSS/sIizeMof(PT) L: sEtep,Size _) 2{) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | :611 Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_4, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthre/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_4, ncclFuncAllReduce, FuncSum, rcclIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threa_dfloat8, NCCL_ALGO_TREIE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ dx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_4, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_S/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDTEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primi/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:own, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_4, ncclFuncAllReduc670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] tive670s,i 0,e, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ d Pro(to, t0> pirimsd | ^) /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:,558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here n 558t | h rrunRieads(nthreads), ting(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RundWInBloock(rthrekadIdxC.x)o, glroulp(gr, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested hereAlg o, Proto63, C | OLL _UN ROL L>( ).rPun(rtidi, smubtin,t worik);v | e ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpps:<12:1T: note: ,in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2)RedOp, FanSymmetric<1>, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlockC(OLtL_hUNrROeLaLd>(I).druxn(.tixd,) s,ub tng, rwoorku);p (| ^g /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cppr:o12:u1:p note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here) ,12 | DEFIN E_| nc ^~~~~~~~~~~~~~~~~cl D/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hevFu:nc670(A:ll60Re:du cenote: _Rfield 'group' will be initialized after field 'stepSize'IN G_ SIMP670 | LE_Sum_f 8 _t2, ncclFuncAllReduce, FuncSumid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nt: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ hreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_4, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' , COLL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ _f8_4, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), g/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_4, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ]/NC/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> pr:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] ims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDownC, COLLL_UN_ROLLS>(tiTd, nEthrePads,S wor/k); s | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hi:432z:78: enote: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here o432 | f i(f (tTid <) sub tn) R:unWo rkColsl( ).ru{n(ti d, s ubtn| , wo ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~rk); | ^ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives4,, n c0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RcluFunncWAlolRredukceC, oFulnclSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ >, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), g/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), roup(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670g:roup(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCC60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ L_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(th, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_4, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ eadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f8_4, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nmmetric<1>, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558threads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_4, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, 1, 2, 4>::run' requested here 22 | 1, NCCL_MAX_DEV_ARITY>, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_4, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f8_4, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_4, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_4, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f8_4, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_4, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_4, ncclFuncAllReduce, Fu/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0,ncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPL Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f8_4, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ E, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threa/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hdIdx.x),:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f8_4, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads( group(group), | ^~~~~~~~~~~ nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_4, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/size/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] of(T) : stepSiz e670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWork_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0Coll().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f8_2, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_4, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_4, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f8_4, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5symmetric<1, NCCL_MAX_DEV_ARITY>, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_4, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ : note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f8_4, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f8_4, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_4, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_4, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f8_4, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadId/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hx.x),:670:15 : gwarning: initializer order does not match the declaration order [-Wreorder-ctor] r670 | o utidp(tid(), ngthrerads(onthrueads)p, ti)dI,nBl ock( thr| eadId ^~~~~~~~~~~~~~~~~x.x) , g/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hroup(gro:up),670 | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~: | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/nthreads(nthreads), tidInBlock(thsizreofe(T)a : dIdx.x), group(group), | ^~~~~~~~~~~ stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f8_4, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 22 warnings generated when compiling for gfx90a. /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_f8_4, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_f8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_f8_4, ncclFuncAllReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ [ 66%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_u32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_u32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_u32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_u32.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = thr eadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE;In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group \ | ^ (); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flaIn file included from g1, dat/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: aIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h2:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7,: warning: unused variable 'w' [-Wunused-variable] 75 | f barrierl_by_groaug2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t dp(); a | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29t:15: note: expanded from macro 'barrier_by_group' a 29 | 1const int w = t, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uinthreadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_32_t data1S, flaIg1, daZta2, fElag2; ;| ^~~~~ \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t daIn file included from ta1, flag1, d/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cppa:2: In file included from t/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.ha:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:1452:14: warning: ,unused variable 'data1' [-Wunused-variable] 145 | f uintl32_t adata1g, flag21, da;ta2, flag2 ; | ^~~~~ | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1In file included from , flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, fla/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2g: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:112: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: ;/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5 : warning: unused variable 'w' [-Wunused-variable] 80 | | bar ^~~~~rier _by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | constIn file included from int w = thr/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:e2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:a11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:d174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75I:7: warning: dunused variable 'w' [-Wunused-variable] 75 | x . barrixer_by/_grouWp(); A | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.hR:29:15: Pnote: expanded from macro 'barrier_by_group' 29_ | SconstI int Zw = tEhread;Idx.x/WARP_SIZE ; In file included from \ \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* In file included from ptr = recvPtr(0)+/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:l2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threa| d ^ In file included from Idx.x/WARP_SIZE; \ | ^ l128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const In file included from int w = threadIdx.x/WARP_SIZE; \ | ^In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:In file included from 2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelI:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ d - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->chann/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:e2: In file included from l/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11L: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.ho:175: ;/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271 :19: warning: unused variable 'ptr' [-Wunused-variable]| ^~~271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ :218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: 80:5: unused variable 'flag2' [-Wunused-variable]warning: unused variable 'w' [-Wunused-variable] 80 | bar rier_by_gr145oup(); | | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29 :15: note: expanded from macro 'barrier_by_group' 29 | c onst in t w = tuhreadIdxi.x/WARnP_SIZE;t \ | ^ 32_t data1, flag1, data2, flag2;In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2;271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | | ^~~~~ barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr =/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_2, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_2, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ imple<1, 1, COLL_UNROLL>, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_2, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_u32_2, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_2, ncclFuncAllReduc5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_u32_2, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch,e, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ algo, proto, unroll>().run(); \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_2, ncclFuncAllReduce, FuncSum, /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ RedOp, FanAsymmetric, /*Direct=*/0, Proto, 0> prims | ^In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508::29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor]565 506: | t5id(tid:), nt hreadsnote: (nthrein instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested hereads), w 565 | runTreeUpDown, warp(tid/WARP_SIZE COLL _UNRO | LL>( tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_t508 i | d, n thread 671s, wo | rk); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hf :lag st432:78: enote: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432p | Size(stepSize_ == 0 ? ncclShmem.comm.buf fif (tiSd < suibtn) RunWorkCozll().run(tidfSizes,[NCCL_ PsubtnR,OTO_LLT 1O_SIMw2PLE]/o8NCCL_rS]TEPS/k/size)Nof(T); C: st eCpSiz eL_) {| ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | _STEDPS/sizeEof(uinFt64_INE_nt)) c{ | c ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group| l/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h: ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prim Primsitives(, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ plit, nthreads-nthanSymmetric<1>, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRingupR, trOeLe->dLown, >wor(k->setndbuid, nthreadff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | rus, nworkT); r| ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.he:432:78e: note: Sin instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here p432 | l iif (titd < (tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78t:n ) Rnote: unWin instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested hereork Col l(t).riun(dt id,< sub tn,s wourk); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | Dbtn) RuEnFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u32_2, nWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFuncccl(FunAcAlllRelducRe, eFundcSumu, ucinte32__t, TNCCLR_ALEGO_ERIN_G, LNLCCL1_PR2OTO8_SI_MPLSE, u2) m | ^_ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hu:611:623: 2_2, ncclFunnote: expanded from macro 'DEFINE_ncclDevFunc' c 611 | A l RulnWoRrkeduce, FuncSum, uint32_t, NCCL_ALBaGtchO, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15:E , Nnote: CCfield 'nthreads' will be initialized after field 'tidInBlock'L_P R OTO_LL167028, | 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h : 611:62t: note: iexpanded from macro 'DEFINE_ncclDevFunc' d(tid), nthreads(611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_2, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_2, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_4, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_2, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_2, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_2, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, w/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_2, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ork); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_2, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_4, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_2, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u32_2, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_2, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Pr/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] imitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nth670 | r tid(tid)e, nthreaads(nthredads), tidIsnBlock(t, work); hreadIdx| .x), gr ^oup(group) , | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h 671 | : 432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | ste if (tid < subtn) RunWpSoize(strepSize_k == 0 C? nccloShmem.lcomm.bluf().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u32_4, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_RING, N | group(group C/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63C:56: note: Lin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | _ PPrimitiRves, I0, PrMoto, 0P> priLms | E ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:,558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 5584 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < ) | ^s /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611u:62: bnote: expanded from macro 'DEFINE_ncclDevFunc' 611t | nRunWo)rkBat ch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreadWsor)kColl,().drun(Itid,dx. sxubtn), wo,rk); | ^ g/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:12r:1: onote: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here up(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: 12field 'group' will be initialized after field 'stepSize' | DEF INE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u32_2, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ _PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_2, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_Sum_u32_2, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u32_2, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u32_2, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | Run/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSiWorkzBatche,C algoC, proLto, u_nrollP>().ruRn(); O\ | ^ T/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:O15: note: field 'nthreads' will be initialized after field 'tidInBlock'_ 670 | S tiId(tidM), ntPhreaLds(nthEreads]), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_2, ncclFuncAllReduce, FuncSum, uint32_t, NCCLIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_2, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ _ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hgroup(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_2, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u32_2, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_4, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_4, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ op, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads),/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h tidInBlock(threadIdx.x), g:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] roup(gr670oup | ), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads (tid(ntid)t, nthhreards(nethreaadsd), tsidIn)Bloc,k(th readtIdxidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ .x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u32_2, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInB18 warnings generated when compiling for host. lock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSizet_id, s ubtn=, = 0 ? ncclShmem.cworok); m| ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cppm:17:1:. note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here b17 | DEuFINE_fncclDfevFunSc(AlilRedzes[NCCL_PROTOuc_e_TRESE_SIMIPLE_SMum_u3P2_4, LE]/NCCL_STEPS/sizeof(T) : stepSizencclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here , _algo,U protNo, unroRll>()O.run()L; \ L| ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h>:670:15: (note: field 'nthreads' will be initialized after field 'tidInBlock' t670 | i tid(dtid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | ), nth readsi(nthrfeads) , tid(InBlotck(thireadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlocd < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_nckc(thlreaDdIdex.xv), Fgrouup(ngrocup)(, | A ^~~~~~~~~~~ llReduce_RING_SIMPLE_Sum_u32_2, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(ste/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] pSi 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u32_2, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ze_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_4, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:r15otoSimple<1, 1, COLL_UNROLL>, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_4, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ : note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_2, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work);/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] : 670 | 432 tid(:tid), n78thread:s(nthr eads), tidInBlocnote: k(thrin instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_2, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NIMPLCE]/CNCCLL_STE_PS/sPizeoRf(T)O : sTtepSOize__) { S| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ I| group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hM:303:P90:L note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here E303 | , P2rimi)tives , /note: *Direxpanded from macro 'DEFINE_ncclDevFunc'ect =*/0 , Proto611, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nth rRedeOpa, PdrotosSi)mple,<1, 1, tCOLL_UiNROdLL>,I COnLL_BUNRlOLLo>(tcid,k nt(htreahds,r woerk)a; d| ^ I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432d:78:x note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here. x432 | ) , i f (gtird ().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:nthreads(nthreads)/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h, tidInBlock(thr:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreadIdx.x), group(group), | ^~~~~~~~~~~ 17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_4, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_4, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ dop, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_4, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_4, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u32_4, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u32_2, ncclFuncAllReduce, FuncSum, uint/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u32_4, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), ti32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), gdrInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ oup(group/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), n), t| ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hh:670:60:r note: field 'group' will be initialized after field 'stepSize' e670 | a tid(dtid),s nthr(eads(nnthretads), htidInrBlocke(threaadIdxd.x), gsroup()group,), | ^~~~~~~~~~~ tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_4, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO611 | _ RSunWoIrkBaMtch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(g/NCCrL_SToEPS/suizeopf(T)) : s,tepSi ze_ | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthread) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); s(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u32_4, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, sub/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthtn, wrork); e | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cppa:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested hered 17 | DsEFINE_)ncclDev,Func(A llReducte_TREEi_SIMPLdE_Sum_uI32_4n, ncclBFuncAl/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthrellReduceo, FunccSumads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ k, uint3(2_thr 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) eadIdx.x), g{ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_4, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ TO_SIMPLE,t 4) | e^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:p62: note: Size(stepSizexpanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, alOgo, prpoto, u,nroll>( ).run(F); \ a| ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:n670:15: note: Afield 'nthreads' will be initialized after field 'tidInBlock' 670 | s tid(ytid),m nthreamds(nthereads),/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] t tidInrBlock (670 | tid(tid), nthreads(nthreadithreadcIdxs), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group .x)<, grou1p(grou,p), | NCCL/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_4, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group)_MAX_DEV_ARITY>, /*Direct=*/0, Proto, ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(thr0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | eadI dx.x), group( group), | ^~~~~~~~~~~ if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_2, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_4, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u32_4, ncclFuncAllReduce, FuncSum, uint32_t, /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_4, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tiNCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.d), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u32_2, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_4, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tnthreads(nthreads), tidInBlock(threadIdx.x), group(/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hgroup), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ adIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u32_4, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u32_4, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_4, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u32_4, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_4, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unr/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_4, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ oll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u32_4, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nth/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] reads), t 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u32_4, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threaidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_SdTIdx.x), group(group), | ^~~~~~~~~~~ EPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u32_4, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u32_4, ncclFuncAllReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for gfx942. 18 warnings generated when compiling for gfx1201. 18 warnings generated when compiling for gfx942. 18 warnings generated when compiling for gfx1101. 18 warnings generated when compiling for gfx1100. 18 warnings generated when compiling for gfx1102. 18 warnings generated when compiling for gfx906. 18 warnings generated when compiling for gfx1200. 18 warnings generated when compiling for gfx908. 18 warnings generated when compiling for gfx1030. 22 warnings generated when compiling for gfx90a. [ 66%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_u64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_u64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_u64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_u64.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable]/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ Idx.x/WARP_SIZIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ E; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | coIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ nst int w = th/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ readIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_byIn file included from _group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ In file included from | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ : warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1,/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h data2:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ , flaIn file included from g2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp::2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h :11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174warning: : /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14unused variable 'data2' [-Wunused-variable]: warning: unused variable 'data1' [-Wunused-variable] 145 | u int32_t data1145, flag | 1, data 2, flag2 ; uint32 | _ ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:t21 data1, flag1,: warning: unused variable 'flag1' [-Wunused-variable] 145 | uidnata2, flag2; | ^~~~~t32_t d ata1, f/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.hlag1, d:145:35a:ta2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = thread/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_grIdox.x/WARuP_SIZpE; \ (| ^ In file included from ); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x//builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hW:ARP_SI11ZE; \: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptIn file included from r = recvPtr(0)+ll128Offset; | ^~~In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1,/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2 : In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: fIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:l271:19: warning: aunused variable 'ptr' [-Wunused-variable] 271 | g 1 , uint 64_data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: t*unused variable 'data2' [-Wunused-variable] ptr = re cvPtr(0)+145ll128 | Offse t; | uint32 ^~~ _t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.cIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ hannelId/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h- work:->cha27nnelL:o; | 15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int b ^~~ id = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncIn file included from clShmem.channelId - work->channelLo; | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] :218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ ^~~ 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h = threadIdx.x/WARP_SIZE; \ | ^ :366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ c/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channonset intl bid I= nccdlShme m.cha-nnelId - wwork->cohannrelLo;k | ^~~ ->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const in/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ t bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_2, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_2, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_2, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_2, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_2, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | Coll ().trun(tid,i subtn,d work);) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp,:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: ncclDevFunc(All/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_2, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Reduce_TREE_SIMPLE_Sum_u64_2, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTOnthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NC_SIMPLCE, 2) L | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:_611:62: note: expanded from macro 'DEFINE_ncclDevFunc'P 611 | R RunWorkOBatch, aSlgo, proto, unrolIl>()MPLE]/NCCL_STEPS/.runs(); \ | i ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:z15eof(: Tnote: field 'nthreads' will be initialized after field 'tidInBlock') : stepSize_) { 670 | tid(tid| ), nt ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90:hrea ds(nthnote: readin instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested heres), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h 254In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_u64_2, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here :670:60:565 note: field 'group' will be initialized after field 'stepSize' | 670 | tid (tid) , nth readrs(nthunTreeUpDown, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here dInBrlock(othreatdIdx.ox),S grouip(grmoup),p | ^~~~~~~~~~~ le<1, 1, COLL_UNROLL>, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ L>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_2, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduceIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthre_TREE_SIMPLE_Sum_u64ads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_u64_2, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ _2, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_2, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_2, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_u64_2, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u64_2, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u64_2, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_2, ncclFuncAllR/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.he:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_2, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ duce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(ty>, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_2, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ threads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_2, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_2, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_2, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatcNROLL>, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPhL, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ E_Sum_u64_2, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | st| ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), gro/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Diup(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdxe.pSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_2, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ x), group(group), | ^~~~~~~~~~~ rect=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_2, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u64_2, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_4, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_2, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_4, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , ty, redop, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | (thr eadId x.x)t, grioup(dgro(upt), i| ^~~~~~~~~~~ d), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u64_2, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subt/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group n) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u64_2, ncc/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u64_2, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), grlFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock'/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h 670 | tid(t:670:i15: warning: initializer order does not match the declaration order [-Wreorder-ctor]d 670 | oup(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ) t,id(ti d), ntnthreads(nthreads), h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u64_2, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ rteads(nithreadds), tIidInBnlock(Bthrleock(thradIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmeadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ em.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u64_2, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_4, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_Sum_u64_2, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u64_2, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u64_2, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ AX_DEV_ARITY>, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunW/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_4, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ orkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_4, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u64_2, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_4, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , ty, redop, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group:670:15: (warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tgid(tid), ntrhreads(nthoreads), tiduInBlock(pthreadIdx.x)), group(g,roup), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | step| Size(step ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSizeSize(_ == 0 ? ncsclShmem.cotemm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : steppSize_S == 0 ?i ncclSzhmem.coemm.buffS_izes[NC)CL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here { | 303 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h | :254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Pr imitive s,DEV_ARITY>, /* /*DDirect=i*/0, Prroto, 0e> pricms | ^t /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:=5*/0, Prot: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, o, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work);COLL _UNROL L>(tid,| nthre ^ads, w ork);/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: :in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | 432 :if (ti78d <: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if su btn)( RtunWoirkColdl(g).ruo, Protn(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncco,l COLDL_UNeROLLv>().Frun(utid,n subtcn, w(ork)A; | l ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cppl:17:1R: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested heree du 17ce_TR | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_4, ncclFuncAllReduce, FuncSum, uint64_tEE_SIMPLE_Sum_u64_4, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Pr/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_4, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:imitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_4, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.com m | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ .buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS//builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hsizeo:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | f(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, tid (tid)0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h, :nthrea565ds(nthre:ads),5 tidInB:lock note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | (threardIdx.xu), gronup(groupT), | r ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ e671 | e stepSUize(stpepSizeD_ == o0 ? ncwclShmenm.comm<.buffSTize, RedOp, ProtoSims[NCCpL_PROTO_SIMPLEl]/NCCLe_STEP, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | : 303:90: note: iin instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303f | Prim(tid < subtn) RunWorkCollic<().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:17:1: note: 1, in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested hereNCCL _MA X_DEV_ARI17TY>, | /D*Direct=*E/0, FProtoI, 0>N priEms _| ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hn:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UcNclDeRvFunOc(AlLlReduLce_T>REE_(SIMPtLE_Sum_u64_4, ncclFuncAllReidd,u nthcreades, wo,rk); | ^F /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hu:432ncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWo:r78: knote: Bin instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here a432 | t c ifh (_UNR,OLL >()a.ruln(tgid,o su,btn, woprk)r; o| ^ t/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:o17:1,: unroll>().run note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | _SI MPL E_Su m_u 64_t4, inccdlFu(ncAtllRiedudce,) Fu,ncSu mn, utinth64_rt, eNCCaL_AdLGOs_TR(EE,n NCtCL_hPRrOTOe_SIMaPLEd, 4s)), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PRO/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group)TO_S,IMPL E]/N CCL_| STE ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~PS/s izeo f(T)| : s tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_tepS ize _) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~671 | group(group | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreadsbuffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: ,in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here wor k); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:254432:78: | note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | i f ( tid < sPubtn) RurnWorikColmlL_UN,ROLL >().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:22:1::15: warning: initializer order does not match the declaration order [-Wreorder-ctor] note: 670 | in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here tid( tid), nt22hrea | ds(nDthreEads),F tiIdInBlNock(Ethre_adIdnx.x)c, grcoup(lgrouDp), e | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~/v* DiFr ecut| =*/n tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_0, c Pro( toA, 0>l671 prl | imR s e | ^d /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h u:565s:c5e_RING_SIMPLE_Sum_u64_4, tepSnize(cstepcSize_ == 0 ? ncclShm: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDownCL,CE L]C_/ONLACLLC_GLUO_N_SRRTOIELNPLGS>,/( stNiiCzdCe,Lo _fnP(tRThO)rT eO:a_ dSssIt,Me PwpLoSEri,kz )e4;_) ) | | { ^^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~:: 611432 ::| 6278 group(group:: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hnote: note: expanded from macro 'DEFINE_ncclDevFunc'in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here: 254 :90432611 | : | note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254i | Rf u n( Wt oi rd k PB,ym mTae,lt grRoie,cd At(Ro)I, .TCrYOu,nL(L )_;1U >N\,R OL /* L>| D( ^i) r./builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hecru:tn670=(:*15/t:0i ,dnote: ,field 'nthreads' will be initialized after field 'tidInBlock'P rs outb670ot | ,n ,0 >wo p rrtikim)ds;( t i| | ^ ^ d /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp)/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h,: :17n565::t15:h: note: note: rin instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested herein instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested heree a ds17565( | | n Dt EhF rI eNrEau_dnnsTc)rc,el eDtUepidDIvonFwBunlnF60u,:n cCnote: AOfield 'group' will be initialized after field 'stepSize'lLL _lR670Ue | Nd R uO c LetL,i> d(Ft(utnicd,Si udnm)t,,h rnueaitdnhstr,6e 4aw_dotsr,(k n)Nt;Ch Cr Lea_| dA ^sL /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hG)O, _:TtRidE432IE:n,78B :lN oCnote: cCin instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested herekL (_ PtRhrO432eT | aO d_ IS dI xM .P xiL)fE, , ( gt4ri)od u p<| ( ^gs ru/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hobut:pn611)):, 62 R: u | nnote: ^~~~~~~~~~~Wexpanded from macro 'DEFINE_ncclDevFunc' o rkColl r(edo)p<.tyr>,u anlg(o,t pirodto,, sunurobllt>(n).,r work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_nccluDn(e);v F\ u| ^n /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hc:(670:A15:l lnote: Rfield 'nthreads' will be initialized after field 'tidInBlock' e670d | u c tei_d(TtiRd)E, Enthreads(nthreads), ti_SdIMIPLnE_BSlumo_uc6k(threadIdx.x), group(4g_4r,o nucclpF)un,cA l lR| ed ^~~~~~~~~~~~~~~~~uc e,/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h Fu:nc670Su:m,60 u:in t6note: 4_t, NCCfield 'group' will be initialized after field 'stepSize' L _ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 670 | 611 t | id (tid), n th reads(nthreads), tidInBlock(threadIdx.x), group(group), R | ^~~~~~~~~~~ unWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Pr: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ imitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u64_4, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u64_4, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_4, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffS/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~izes[NC CL_PROTO _SIMPLE]| /NCCL_STE tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_PS/size of(T) : stepSize_) { 671 | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h :303:90stepSi: note: zin instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | e(stepSize_ == 0 ? ncclShmem.comm. b Priumitivefs, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5:CCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, ProtoSimple<1, 1, 4>, 4>' requested here 565 | r runiTreeUpcDown, /*Direct=*/0, PrOLL>, oCOLLt_UNRoOLL>,(tid , n0threa>ds, workp); r| ^ ims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTree/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hU:432:78p: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested hereD 432o | wn< T ,if (tidR < seubtnd) RunOWorkpCo, ll().rProtoSimple<1, 1, COLL_UNROLL>, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < suun(tbid, tsubtnn, w)ork) ; | ^R /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cppu:17:1n: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here W 17o | DEFrINkE_nccClDevoFuncl(AlllRedu().run(tid, subtn, worm,k uin)t64_;t, N CCL_ ALGO| _TRE ^E, N CC/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:17:1:L_PROT/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hO_SIM:P670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u64_4, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), grouLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | R unote: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here n17 | DWEFINoE_ncrclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_4, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611kBa:tc62h, algo, pr611oto, | unr oll> ().r un() ; \ R| ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hu:n670:15: Wnote: field 'nthreads' will be initialized after field 'tidInBlock' o670 | r ktid(Btida), ntthrecads(hnthr, algo, proto,hreadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthrea undroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), s), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_4, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBl/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_4, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreaock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_4, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ds(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_4, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u64_4, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78 stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u64_4, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u64_4, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u64_4, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u64_4, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u64_4, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u64_4, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h(:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u64_4, ncclFuncAllReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for host. 22 warnings generated when compiling for gfx90a. 18 warnings generated when compiling for gfx1102. 18 warnings generated when compiling for gfx1100. 18 warnings generated when compiling for gfx1200. 18 warnings generated when compiling for gfx906. 18 warnings generated when compiling for gfx1101. 18 warnings generated when compiling for gfx908. 18 warnings generated when compiling for gfx1030. 18 warnings generated when compiling for gfx1201. [ 67%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_u8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_u8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_u8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_u8.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cppIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ :1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_tIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.hIn file included from :175/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cppIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ : /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.hIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ :271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | cons/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ t int bid/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ :366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ clShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = nIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ cclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_u8_2, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_u8_2, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_Sum_u8_2, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_2, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_2, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_2, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_2, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_2, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_2, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_2, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_2, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_2, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_2, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_2, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_2, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_2, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_2, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_2, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_2, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_2, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_2, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_2, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_2, nc/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_2, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ clFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_Sum_u8_2, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_2, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u8_2, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u8_2, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u8_2, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u8_2, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u8_2, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u8_2, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u8_2, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u8_2, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx./builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u8_2, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u8_2, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u8_2, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_4, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_4, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_4, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_4, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nt/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:hreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_4, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_4, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_4, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_4, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_4, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_4, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_4, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_4, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_4, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_T/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_4, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ REE_SIMPLE_Sum_u8_4, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u8_4, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_4, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_4, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: go, Proto, COLL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_4, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_4, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_4, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_4, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Sum_u8_4, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u8_4, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u8_4, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u8_4, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u8_4, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u8_4, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWo/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tric<1>, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u8_4, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ dInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u8_4, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Op, FanSymmetric<1>, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u8_4, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u8_4, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sum_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_Sum_u8_4, ncclFuncAllReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for host. 18 warnings generated when compiling for gfx1100. 18 warnings generated when compiling for gfx1030. 18 warnings generated when compiling for gfx942. 18 warnings generated when compiling for gfx906. 18 warnings generated when compiling for gfx1201. 18 warnings generated when compiling for gfx908. 18 warnings generated when compiling for gfx1200. 18 warnings generated when compiling for gfx1101. 18 warnings generated when compiling for gfx1102. 22 warnings generated when compiling for gfx90a. [ 67%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h: 11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75 :7: warning: unused variable 'w' [-Wunused-variable] 75 | bbarrier_by_agrourrier_by_group(p(); )| ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15;: note: expanded from macro 'barrier_by_group' 29 | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h | c:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = onst intt w = thhreadIdrx.x/WARP_eadIdx.xS/IZE; \ WA| ^ RP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: In file included from warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:15: note: expanded from macro 'barrier_by_group' 29: | const75:7: warning: unused variable 'w' [-Wunused-variable] int w = threadIdx.x/WARP_SIZE; \ | ^ 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, f/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, fllaag1, dgata2, f1lag2; | , ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145: 28: warning: unused variable 'data2' [-Wunused-variable] d 145 | auint32_tt data1a, flag12, data2, ,flag2; | flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:warning: 145:35: warning: unused variable 'flag2' [-Wunused-variable]unused variable 'data2' [-Wunused-variable] 145 | ui 145 | uint32_nt32_tt data1 , flag1data1, flag1, ,data2, flag2d;ata2, flag2; | ^~~~~ | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | bar/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpprier_:by_gr2oup(): ; | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | b15: note: aexpanded from macro 'barrier_by_group' 29 | r const int w =r threadIier_by_groupd(x.x/W)ARP;_SIZE; | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15:\ | ^ note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp=:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11 : In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: r/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: ewarning: unused variable 'ptr' [-Wunused-variable] 271c | v P uitnt64_tr* ptr (= recv0Ptr(0))+ll12+ll128Offset; | 8Of ^~~fset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | conIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ st int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShme/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hm.channelId - work->channelLo; | ^~~:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h->channelLo; | ^~~ :366:15/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ :366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - workIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ ->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flaIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ g2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flaIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ g2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flaIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ g1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, fIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ lag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE;/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h; | ^~~ :218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_2, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTOIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(g_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hroup), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_:565: 5 671 | : stepSi ze(stenote: pSize_ in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here== 0 ? nc clShmem .comm.buffSize565s[NCC | L_PROTO _ rSIMunTrPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, woymmetric, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthrkr); e| ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cppa:7d:1: snote: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here ,7 | DE FINwE_ncoclDervFuknc(A)llR;educ e_T REE| _SI ^MPLE _Sum/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hP:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | ostDiiv_if32_ 2, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBa(tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_tch2, lalgoF, pruoto,n uncrollA>().lrun(l); \R | ^e /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:d670:15:u note: field 'nthreads' will be initialized after field 'tidInBlock' c 670 | e ,tid( FuncStuid),m nthPosreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tDiv , in t3tid(tid), nthread2_t,s NCC(L_ALnGO_TRtEE,h NCCLr_PROeTO_SaIMPLdE, 2s) | )^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nth, tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ reads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_2, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_2, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid (tid), nthread s(nthreads), t idInBlock(threa dIdx.x)Primitives 0 ? ,ncclS /*Direct=*/0hmem.co,mm.buff Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: Sizes[NCCL_PROTO_note: SIMPLE]/Nin instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested hereCCL_STE 565 | rPS/sizeof(T) : sunTreeUpDown, COLL_UNROLL>(titepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_2, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ d, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_2, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_2, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i32_2, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , NCCL_MAX_DEV_ARITY>, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_2, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_2, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, In file included from wor/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:k173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15): warning: initializer order does not match the declaration order [-Wreorder-ctor] ;670 | tid(tid ), nthr| eads(n ^threads ), t/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:idIn7Bl:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here oc k(thre7 | DEFINE_nccadIdx.lx), groDup(grevFunc(AllReduce_oTup), R| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ E 671 | E_SIMP L steE_SumPostDiv_i32_2, pSnize(stcepSize_ =c= 0 ? nlFuncAllRcclShmem.comm.buffSizes[NCCL_PReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62O:TO_SIM PLE]note: /NCCLexpanded from macro 'DEFINE_ncclDevFunc'_STEP S/siz 611 | RunWorekof(T)B : staepSizte_) {c | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ h | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h<:c254:90: onote: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | l l Pri,mitiv es, algo,Red Op, FpanAsymrmetrioc().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | L_M AX_DEV tid(tid_ARI)TY, ,1>, /*nDirect=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreethreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreadUpsDown), COcLL_UkNROL(L>(ttid, hnthrreades, woark);d | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hI:432:78d: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested herex 432. | x if) (t,id grou < spub(groupt)n) R,unWo | ^~~~~~~~~~~ rkColl, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_2, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_2, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nth:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, woreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ rk->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_SumPostDiv_i32_2, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i32_2, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidIn/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.bBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidIn/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i32_2, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ BluffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_2, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCIn file included from CL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUp/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cppD:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11o: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508w:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] n 506 | , COLL_UNROLL>(tidp(t,id/WARP_ SIZE), n| ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) t 507 | hwarpInBlorck(threaedIdx.ads, workx/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3),); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc gr(oup(grAoup), l | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3l 509 | R steepSized(ncclShumem.cocmm.bufefSizes[_NCCL_PTROTO_LRL128]/ENCCL_STEEPS_S/IsizeofM(uint6P4_t)) L{ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ E| group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h_SumPostDiv_i32_2,:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &t ncrclFunecAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo,e-> up, pwork-r>senodbuftf, woork,->re cvbuuff, nworkr->roedOlpArg,l 0*>Prot(o::M)axGr.oupWridthu); n| ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h(:1070):5: ;note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | \ ru nTr eeSpl| it(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subttid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ n) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_SumPostDiv_i32_2, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_SumPostDiv_i32_2, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_4, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_2, nc/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_2, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ clFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWOoLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i32_2, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ rkBatch, algo, proto, In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeunroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | of(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_2, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWo/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | r kColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_2, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_4, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads)/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, , tidInBlock(threadIdx.x), gCOLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_2, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ roup(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_2, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i32_2, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_4, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_4, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEP/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i32_2, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ S/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, wor/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hk); | ^:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlo /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cppc:7:1: note: kin instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | (DEFINE_tncclDehvFunc(rAllReduece_TREaE_SIdMPLE_SIumPostDdiv_i32x_2, nc.clFunx)cAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreetriac<1,d NCCLs_M), tidInBlock(threadIdx.AX_DExV_A)RITY>,, /* Direcgt=*/roup(gro0, uProto,p 0), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize'> prim s | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h670 | tid(tid), nthreads(nthreads),/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | run:565 :5:t note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested herei d565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested heret hre adId432x.x), group(group), | ^~~~~~~~~~~ | if (tid R(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i32_2, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), n subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_4, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl() tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_4, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group.run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i32_2, ncclFuncAllReduce, FuncSumPostDiv, int32_t), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i32_2, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i32_4, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i32_4, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i32_2, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ll, ty, redop, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid),In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthread nthreads(nsthread(s), tindInBlotck(thrh/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_4, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ eadIdxr.x), egroup(agroupds), wid(t), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ i | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ d%W671 | ARP_SIZE), stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPwLarp(tEid/WAR]P_SIZE)/, | ~~~~~~~~~~~~~~~~~~ N | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) C507 | wCarpInBlLock(th_readIdSx.x/WTARP_SIEZE), | PS/siz ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ eof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h| warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PR:254:90O: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here T254 | O Pri_mitivesL, C/*CL_STEDirect=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here S63 | Pirimitimves, COLL_UNROLL>(t<1>, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, worid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_4, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ k); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_SumPostDiv_i32_2, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COeLL_UNmROLL>.(tid,c nthreaods, wmork);m | ^ ./builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432b:78:u note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here f432 | f ifS (tid i< subztn) ReunWorskColl<[Fn, TN, RCedOp,C AlgL_PROTO_SIMPLE]/NCCL_STEPS/sizo, Peroto,o COLLf_UNRO(LL>()T.run(t)id, s ub: stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | tn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFun group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> pric(AlmlRedusce_TR EE_SI MPLE_| SumPo ^stDiv _i32_/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h4, ncclFunc:AllRe565duce,: Fun5cSumP:ostDi v, innote: t32_tin instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here, NCC L_AL GO_T565REE, | NCCL_ PROTO runTreeUpDown, CkBatch, algo, proto, unrOLoL_UNROlLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < sublt>(n).run()); \ RunWorkColl().run(tid, subtn | ^ ,/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h: 670:15:w note: field 'nthreads' will be initialized after field 'tidInBlock' o 670 | r tkid(t)id), n;thr eads (nth| read ^s), tid/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cppInBlock(thr:eadI17dx.x:), g1roup(grou:p), note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60:17 note: field 'group' will be initialized after field 'stepSize' | 670D | E tidF(tidI), nNthreEa_ncclDevFunc(AllRds(nthreads), tidInBlock(threadIdx.x), educge_TRrEE_SIMPLE_SumPostDiv_i32_4, ncclFuncAllReduce, FuncSumPostDoup(group), | ^~~~~~~~~~~ iv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : ste/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_4, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatpSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i32_2, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_4, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] : note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i32_4, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthread303 | Primitives, /*Direct=*/0, Proto,s), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:17:1 nt:hreads (nthreanote: ds), tiin instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested heredInBloc k(threa dIdx.x), group17(group) | , | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ D | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ EFINE_n 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hcclDevFun:c(All254R:90:educe_TREE_SIMPLE_SumPostDiv_i32_4, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254a | l Prgimitioves, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here o565, unro | ll>() .run( ); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h :670:15:r note: field 'nthreads' will be initialized after field 'tidInBlock' u670 | n tid(Ttid), rnthreeUpDown, COLL_UNeads(RnthreaOds), tLidInBlLock(t>hread(Idx.xt), grioup(gdroup), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_, | ^~~~~~~~~~~ SumPostDiv_i32_4, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_4, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_4, n/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(cclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_4, ncclFu/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(sncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' tepS 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_4, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Pri/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hmitives, 0, Proto, 0> prims | ^ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_4, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PR/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ to, COLL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i32_4, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBl/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.ho:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i32_4, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ck(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(g/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hrou:p), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ 670 | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ :671 | s15tepSiz:e(step Size_ warning: == 0 initializer order does not match the declaration order [-Wreorder-ctor]? nccl Shmem. comm.buffSizes670[NCCL_ | PROTO_ SIMPLE ]/NCCL _STEPS/ sizeotf(T) : sitepSizde_) { (| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(groupt /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hid), :303n:90threads(nthreads), tidInBlock(th: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtnSymmetric<1>, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing().UrunN(tiRd, sOLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: ubtnote: n, win instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested hereork ); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:17432:1: | note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | D E FINE _nccilf (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_DSiv,I inMt32P_t,L NCECL__ALGSO_TuREEm, NPCCLo_PRsOTOt_SIDMPLiE,v 4_) i| ^ 3/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:6112:62:_ note: expanded from macro 'DEFINE_ncclDevFunc'4 ,611 | R unnWorckcBatlch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInB | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlockl(ockt(thhreardIdex.xa), dgroIupd(grxoup.), x | ^~~~~~~~~~~~~~~~~) /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60:, note: field 'group' will be initialized after field 'stepSize' g670 | r otidu(tidp), (nthgreards(onthurepads)), ,tid InB loc| k(t ^~~~~~~~~~~~~~~~~hr ead/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hIdx.:x),670 gr:oup60(gro:up ), note: | ^~~~~~~~~~~field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i32_4, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | ( tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | t id, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl]/NCC(L_STE)PS/si.zeof(rT) u: stepnSize_() { t| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | i group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:d303:90: note: ,in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduc Perimit_ives,L /*/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hDiErect=_*/0, :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NPSumPostDiv_i32_4, nrCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i32_4, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | oto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runcclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch,T reeU RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ apDown, tCOLL_oUNRO,LL>(t idunroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15:, nthnote: reafield 'nthreads' will be initialized after field 'tidInBlock'ds, wo rk); 670| ^ | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h :432: 78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432t | iif (tid < subtn) RunWorkColl().run(d(ttid)i, ndthr,ead s(nsthrueadbs),t tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tn, iwordk);) | , ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp :n17:1t: note: hin instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here r17 | DeEFIaNE_dncsclD(evFnunct(AlhlRerducee_TREEa_SIdMPLsE_S)umP,ost Divt_i3i2_4d, nIcclnFunBclock(threadIdx.x), group(group), | ^~~~~~~~~~~ AllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 18 warnings generated when compiling for host. 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i32_4, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i32_4, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i32_4, ncclFuncAllReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for gfx1200. 18 warnings generated when compiling for gfx1201. 18 warnings generated when compiling for gfx1102. 18 warnings generated when compiling for gfx1030. 18 warnings generated when compiling for gfx906. 18 warnings generated when compiling for gfx1101. 18 warnings generated when compiling for gfx1100. 18 warnings generated when compiling for gfx942. 18 warnings generated when compiling for gfx908. 22 warnings generated when compiling for gfx90a. [ 67%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadId x75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ .x/WARPIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ _SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_In file included from g/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ roup(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h::21218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ : warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flagIn file included from 1, data2, flag2; /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: | ^~~~~note: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:expanded from macro 'barrier_by_group'35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t dat29a1, fla | g1, dat a2, flag2 ; | ^~~~~ con/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ st int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] In file included from 27 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h: 11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h :145:14:c warning: unused variable 'data1' [-Wunused-variable] 145 | o uintn32_t dasta1, fltag1, data2int bid = ncclShmem, fl.ag2; | c ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21h: warning: unused variable 'flag1' [-Wunused-variable] annelId - work->channelLo145; | u int32_t data1,| flag1, ^~~ data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WAR/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ P_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ :56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, workIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | ); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, PIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ roto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here :670:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_Pwarning: Rinitializer order does not match the declaration order [-Wreorder-ctor] 670 | O tid(tid), nthreTads(nthreOads), ti_dInBlock(SthreadIdIx.x), grMoup(grouPp), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~L | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671E | step,Size( 2) | ^ stepS/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hize_ :611:62:== 0 ? nnote: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorcclShkmem.commB.buffatch, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ c<1, NCCL_MAX_DEV_ARITY>, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDeIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ vFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_4, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:7:1: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDe 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(gvFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: roup), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STE:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdxPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ .x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid , FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_4, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunW/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ x.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_4, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | t tiid(tidd), (tid), nthreads(nthreads), tidInBlock(threadIdx.x)nt,hreads(nthreads), tidInBlock(threadIdx.x), group(gr group(group), | ^~~~~~~~~~~ oup), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads,:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | wo rk);t | ^i /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:d432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid <(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), sub tn) Run| Work ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~Coll stepSize(st().reun(tpid, Ssubtin,ze_ == 0 ? ncclShmem.comm.buffSizes[NCCL_ wPork)R; | O ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cppT:7:1O: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here_ 7S | DEFINE_nIcclDMevPFunc(LAllReEduce]_/TRNCCL_STEPS/sizeof(T) : stepSize_) EE{_SIM PLE _SumP| ostD ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iv_i 64_2 , ncclFuncAllReduce, FuncSumPostDiv,| group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives int,64_t , NC/CL_A*LGO_DTREEi, NCCrL_PReOTcO_SItMPLE=, 2)* | ^ //builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:6110:62:, note: expanded from macro 'DEFINE_ncclDevFunc' 611 | P RurnWorokBato, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runtTch, aDlgo,o pwroto,n unrT().r,un(); RedOp, ProtoSimple<1 \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , 1, COLL_UNROLL>, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_4, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid)/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i64_4, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Dire/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(thre/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(groctu=*/0,p Prot)o, 0>, prim s | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h| :565: ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | r| unTre tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_eUpDo wns, COLLt_UNROeLL>padIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkCo(tid,S ntize(stepSize_ == 0 ? ncclShmem.comm.ll().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(buffSizes[NCCL_PROTO_hreaSds, wIork); M | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hP:432:L78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested hereE 432 | ] /if (Ntid < subtn) RunWorkCtid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ollCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | ruo().ruwn(tidn, sub, 0, 2, 4>::run' requested here e17 | DEFdINE_ncOclDepvFunc(,AllRe duce_PTREE_rSIMPLE_SumPostDiv_i64_4, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCLotoSimple<1, 1, COLL_UNROLL>, COLL_UNROLL>(tid, nthreads, wor_AkLGO_T)REE, ;NCCL_ PROTO _SIMP| LE, 4 ^) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | : RunW432orkBa:tch<78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkCocolll, tly, re, algno, proto, unro,ll>(). run()T; \, RedOp, Algo, Proto, COLL_UNROLL>().run(tid, sub | ^ t/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670n:15: note: field 'nthreads' will be initialized after field 'tidInBlock', 670 | twid(tiod), rnthrekads(n)thread;s), t idIn | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here Block(threadIdx.x), grou 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_4, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4)p(gro up) , | | ^ ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h :/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h670:60: :note: field 'group' will be initialized after field 'stepSize' 611 670: | 62 t:id( tidnote: ), expanded from macro 'DEFINE_ncclDevFunc'nth rea ds(nt611hre | ads ), tid InB locRk(tuhrenadIWdx.ox),r grokup(Bgroaup)t, c| ^~~~~~~~~~~ h, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_4, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ umPostDiv_i64_2, ncc/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i64_4, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(lFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreatid), nthreads(nthreadds), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ s(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_4, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.commIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] .buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i64_2, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_4, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i64_4, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_4, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_4, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run()18 warnings generated when compiling for gfx1102. ; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h(:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_4, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), grgroup), | ^~~~~~~~~~~ oup(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_iCL_MAX_DEV_ARITY>, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i64_4, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' ); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < 64_4, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_4, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_4, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(thre/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_4, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | adIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_4, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_4, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_4, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i64_4, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i64_4, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_4, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i64_4, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threa/*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_4, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ dIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : step/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_Size_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i64_4, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ NCCL_MAX_DEV_ARITY>, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i64_4, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i64_4, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for gfx1200. /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i64_4, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i64_4, ncclFuncAllReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for gfx1030. 18 warnings generated when compiling for gfx1201. 18 warnings generated when compiling for host. 18 warnings generated when compiling for gfx942. 18 warnings generated when compiling for gfx1100. 18 warnings generated when compiling for gfx1101. 18 warnings generated when compiling for gfx908. 18 warnings generated when compiling for gfx906. 22 warnings generated when compiling for gfx90a. [ 67%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:1In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ : In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y,In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: head, mantisswarning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ a; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1,In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cppIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ :2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | u/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cppi:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ nt64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShme: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:m174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7:. warning: unused variable 'w' [-Wunused-variable] 75 | c bahrrier_bya_group();n | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:n29:In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 15: note: expanded from macro 'barrier_by_group' e 29lId - woIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ rk->channelLo; | ^~~ | const int w = barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.cha/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from n/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174n: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: eunused variable 'data1' [-Wunused-variable] 145 | l uint3I2_t datad1, flag1, - work->channelLo dat;a2, flag2 ; | | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h: ^~~145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ _t dataIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:21, flag1,: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | cIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ onsIn file included from In file included from t int | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.hIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ :271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ o; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channeIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ lId - work->channelLo; | ^~~In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp: 2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.xIn file included from /WARP_S/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cppIZE; \ :| ^ 2/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ : In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ +ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group();:366: 15: warning: unused variable 'bid' [-Wunused-variable] 366 | | con ^~~~~~~~~~~~~~~~~~st int bid =/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h ncclShme:29:15: note: expanded from macro 'barrier_by_group' m.channe29lIIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ | d c- woronsk->t int w = threadIdx.x/chanWnelLoA; | ^~~ RP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->cchannelLo; | ^~~ hannelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.b | ^u /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hf:670:15: fnote: field 'nthreads' will be initialized after field 'tidInBlock' S670 | i tizd(tide), nthsreads[(nthNreadCs), tiCdInBlLock(t_hreadPIdx.xR), grOoup(gTroup)O, | ^~~~~~~~~~~~~~~~~_ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hnthreads(nthreads:670:)15: warning: ,initializer order does not match the declaration order [-Wreorder-ctor] 670 | t itidd(tidI), nnthreaBds(nlthreoads)c, tikdInB(lthreock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | steapdIdSx.x),i grozup(greoup),( | ^~~~~~~~~~~s tepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.xIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, P/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hroto:670:15: ,warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(0tidIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:5:1: ), >nthreads( nthreadsp), tidInBlock(threadIdrx.ims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDownx), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stenote: pin instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ Size_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(ti d== 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(grou,M PLE]/NsCCL_STuEPS/sibzeof(tT) : stepSize_n) { | , ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h :303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5p), | ^~~~~~~~~~~ : note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTrework); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatcheUpDown, COLL_UNROLL>(tid, nthre, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidIn/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direc/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDet=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | vFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, s ^u /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ btn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h(nthreads), tidInBlock(threadIdx.x),In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ (nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO== 0 ? ncclShmem.comm.buffSizes[NCCL_P_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_4, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), grou/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group p(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkCollIn file included from ().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[N 611C | C RunLWorkB_atchP, SalgoI, prMoto,P unrLoll>E().r]un()/; \ N | ^ C/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:C670:15:L note: field 'nthreads' will be initialized after field 'tidInBlock'_ 670S | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tiTEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:d565), n:thre5ads(:nthr eadsnote: ), tin instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested hereid InBl ock(thr565eadI | dx.x ), g roup (gro up),r | u ^~~~~~~~~~~ nTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_4, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid)/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0TY>, / * ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Dire?ct=*/0, Proto, n0> primsc | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hc:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here l565 | rSunTreeUphDownI, COLLM_UNROLL>P(tid, ntLhreaE]/NCds, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | n) R unWorrkColl().prunD(tid,o subtnw, wornk); <| ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:7T:1: ,note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t RedOp, ProtoSimple<1, 1, COLL_UNROLL>, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl , alCgo, OprotLo, uLnrol_l>(UNROLL>().run(tid, subtn, work); | ^ ).run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadI/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_d2x.x,), gronup(cgrocup)l, F | ^~~~~~~~~~~~~~~~~ u/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:n670:60c: note: field 'group' will be initialized after field 'stepSize'AllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMP L670 | E ,tid (ti2d),) n thr ead| s(n^th rea/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hds), :tid611I:nB62loc:k(t hrnote: eadIexpanded from macro 'DEFINE_ncclDevFunc'dx.x ), gro611up(g | ro up), | ^~~~~~~~~~~ RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670OLL>, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid) , group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(A< subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncllReduce_RING_SIMPLE_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ clDevFunc(Al:l60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Reduce_TREE_SIMPLE_SumPostDiv_i8_4, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(grou/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (p), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i8_4, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runIn file included from Ring(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i8_2, ncclFunc/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: AIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from l/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:l29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] R506 | tid(tid), enthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3)d,uce, FuncSgumPostDiv, int8_t, NCC/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hL:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670) | : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_4, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid(tidroup(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> p), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ rims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_4, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthread/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hs(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEP/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:S558:/5: note: sin instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558i | z runeRingo( tid,s ntthreaeds, pworSk); i| ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); ze_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, 1, 2, 2>::run' requested here 12P | DEFrINE_onccltDevFuonc(ASllReiducem_RINpG_SIlMPLea, tCOcLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl,O aLlgLo>, p(roto,) u.nrrolul>n()(.rtuni(d);, \ s| ^ u/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hbt:670n:15,: note: field 'nthreads' will be initialized after field 'tidInBlock'w 670ork); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), (AllReduce_TREE_SIMPLE_SumPostDiv_i8_4, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RuntWidoInrBlkocBk(athtrecadhId, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_4, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hin instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_4, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), ti/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), ntdhInBlockr(threadeIdx.x),a group(dgroup), s | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h(:670:60: note: nfield 'group' will be initialized after field 'stepSize' 670t | tidh(tid), rnthreades(nthraeads), tidInBlock(threadIdxd.x), grsoup(grou)p), | ^~~~~~~~~~~, tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i8_2, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group).comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STE, | ^~~~~~~~~~~ PS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_4, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto,/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 unroll>().run(); \ | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_4, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | ti| ^~~~~~~~~~~ d(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_4, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i8_4, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_4, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_4, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), gll, ty, redop, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthrreoup(group), | ^~~~~~~~~~~ ads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i8_4, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x),| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_4, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ anSymmetric<1>, 0, Pr/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hoto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkCollds), t(idInBlo)ck.run(ti(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeod, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_4, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ _i8_4, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) f(| T) : ^stepS ize_/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | : group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h611:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, :62: /note: expanded from macro 'DEFINE_ncclDevFunc' 611 | *Direc t RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ =*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_4, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_4, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ dop, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/size/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.ho:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_4, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ f(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_4, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_4, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i8_4, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i8_4, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_4, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i8_4, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_i8_4, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i8_4, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i8_4, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i8_4, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_i8_4, ncclFuncAllReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for gfx1100. 18 warnings generated when compiling for gfx1102. 18 warnings generated when compiling for gfx942. 18 warnings generated when compiling for gfx1101. 18 warnings generated when compiling for gfx906. 18 warnings generated when compiling for gfx1030. 18 warnings generated when compiling for gfx908. 18 warnings generated when compiling for gfx1200. 18 warnings generated when compiling for gfx1201. 22 warnings generated when compiling for gfx90a. [ 68%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h :366:15: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.hwarning: unused variable 'bid' [-Wunused-variable] 366 | :29:15: note: expanded from macro 'barrier_by_group' 29 | c oconst innt bid = snccltS hmem.channelId - work->channelLo; | ^~~ int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173In file included from : /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE;/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group'In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint3In file included from 2_t dat/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ a1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h/WARP_SIZE; \ | ^ :218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = nccl/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from S/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:h174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7:m warning: unused variable 'w' [-Wunused-variable] 75e | bamrrier_by._group()c; | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:hannelId - work->channelLo; | ^~~ 29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | cons/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2t: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h :174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:i14: warning: unused variable 'data1' [-Wunused-variable] n145 | utint32 _t datab1, flagi1, datad2, fla/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp g:2: In file included from = n2; /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hc | :11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175clSh: mem. ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.hc:145:21::h 80:5: warning: annelIdunused variable 'w' [-Wunused-variable]warning: unused variable 'flag1' [-Wunused-variable] 145 - work->c | huint3annelLo; | ^~~2_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t 80 | dbarriear_by_grtoup(); a| ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h1:, flag1, data2, flag2;29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ const int bid = ncclShmem.chann/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ elId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ id = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable]In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const iIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ nt w = threa 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ dIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.In file included from x/WARP/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:_11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: S/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: Iwarning: unused variable 'w' [-Wunused-variable] 80 | Z barrEier_b;y_group (); | ^~~~~~~~~~~~~~~~~~ \/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ c| o ^ nst int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t dIn file included from ata1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable]In file included from 145 | uint3/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 2_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ 15In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ : warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const i/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15:n warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ t bid = ncclShmem.channelId - workIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ ->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_SumPostDiv_u32_2, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_SumPostDiv_u32_2, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_SumPostDiv_u32_2, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_2, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_2, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_2, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_2, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_2, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthread/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(s(ntnhreadts), thidInBlrock(tehreadaIdx.xd), grsoup(gro)up), , | ^~~~~~~~~~~ tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_2, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_U/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_T:R670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor]E 670 | E tid(_tid), SnthreadsI(nthreMads), PtidInBlLock(thrEeadIdx._x), groSup(grouup), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~m | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ P671 | stoepSize(sstepSizet_ == 0 D? ncclShimem.comvm.buffS_izes[NuCCL_PRO3TO_SIMP2LE]/NCCL_2, nc_STEPS/sizeof(T) : stepSclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PizeR_) { | O ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hTO_:303:90: Snote: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | I PrMimitPiLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' ve611sEV_ARIT,Y>, /*D irect=*/0a,lgo, proto, unroll Pr>oto, 0> (prims )| ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h.run():565:5: ; note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:565 | r670un:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | Tr eeUpDo wn), COL,L_UNR OLL>(tgidroup(group),, nt hread s, wo| ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tidr(k); t| ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hi:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here d 432 | ) i,f nthreads(nthread(stid < )subtn,) RunW orktidInBColll().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_2, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_2, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(ti/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_2, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ d), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid ncc().run(tid, sulShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_btn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_2, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE,STEPS /sizeoNf(T) C: steCpSize_L) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90:_ note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | P RPrimitOives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthtoSirmple, nCOLLt_UhNROLrL>(teid, nathredads,s wor)k); , | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h :432t:78: inote: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here d432 | I nif (Btid lock(threadIdx.x), group(groupIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: ), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tidinitializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Pr), nthre< subtn) RunWorkColl().run(tid, subtn, work); | ^ imitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_2, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), gr/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here ads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_2, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreadosup(group), | ^~~~~~~~~~~ ), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_2, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u32_2, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_2, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_2, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ hmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h_SumPostDiv_u32_2, ncclFuncAl 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_2, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | l Redutce, FiuncSumdPostDi(v, uintt32_t,i NCCL_dALGO_R)IN, nthreG, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatchock(th,readId x.x), agroup(lgroup)g,o, proto, u | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ n | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671r | steopSize(lstepSilze_ == >0 ? n().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), cclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), 303 | | Pr ^~~~~~~~~~~imit ives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_2, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u32_2, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u32_2, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_SumPostDiv_u32_2, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(:nthreads),670 tidInBlo:ck(threa15dIdx.x),: group(gr oup), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~warning: | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ initializer order does not match the declaration order [-Wreorder-ctor] 670 | 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROT tOid(tid), _nthreads(nSthreIMPLE]/NCCL_STaEds), tidPInBlock(thSreadIdx.x/), group(sgroupizeof), (| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ T671 | ) :stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/si sztepSizee_) { o| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested heref 303 | ( PTrimitiv)es, /*Direct=*/0, Proto, 0> prim{ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ s | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h :303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | | P ^rimiti ves, ProtoSimple<1, 1, 2>, 2>' requested here 565 | ru FanAsnymmetTric<1, rNCCL_MAeXeUpDown, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown,RedOp, ProtoSimple<1, 1, COLL_UNROLL>, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl(Otid, LnthreLads, _work);U | ^N /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:R432:78: Onote: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here L432 | L >().run(tid, subtn, work); | if ^(tid < s/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cppubtn) RunWorkColl, 0, 2, 2>::run' requested hereL 7_ | DUEFINNE_ncRclDeOvFunLc(AlLlRe>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_nccdulceD_TREeE_SIvMPLEF_SuumPonstDicv_u3(2_2, AnccllFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMlRPeducLe_TREEE_S,IMPLE _Sum2Post)Div_ u32_ 2, n| ccl^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | F uncRAlluRedunceW, oFunrcSukmPoBstDiav, tuinct32h_t,< NCCcL_AoLGOll,_TREE, NCCL_PROTO_ tyS, rIedoMp, alg611o, | pro to, un rol l>R()u.rnun(W); o\ r| ^ k/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:B670atchti,d( tid)a, lngo, proto, unroll>threads(nthreads), tidInBlock(threadIdx()..runx();) \ , | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hg:r670:oup(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid),: nt670hre:ads(60: nthrenote: adsfield 'group' will be initialized after field 'stepSize'), tid 670 | tid(tid), nthreaInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tidds(n(thrteadis),d ti)dIn,Blo ck(nthrteadhIdxr.xe), agrodup(sgro(up)n, t| ^~~~~~~~~~~ hreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_2, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWor 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nktColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_4, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ hreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u32_2, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_4, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSi/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_4, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ze(stepSize_ == ), nth0reads( nthreads?), tidI nBlock(nthreadIdcx.x), group(grcoup), | lShmem.comm.buffSizes[N ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:C670:60: note: Cfield 'group' will be initialized after field 'stepSize' 670L | _tid(tiPd), nRthreaOds(ntThreaOds), _tidInSBlock(IthreaMdIdx.Px), gLroup(Egroup]), | /NCCL_STEPS/siz ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), eofg(T) :r stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ o | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hu:303:90p: (note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here g303 | r Porimiutivesp, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc 0 ? ncclShmem.comm.buffSizes[NCCL_PROT(OAllReduce_TREE_SIMPLE_SumPostDiv_u32_2, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ _SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u32_2, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u32_2, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(grou: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ p), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u32_2, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u32_2, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBl: stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl, o0, Pto, COLL_UNROLL>(ro)to, 0.> prirms | ^u /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hn:558:5: (note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558t | irunRidng(tnid, n,threa ds, wwork);o | ^ r/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:k78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here ) 432 | ; if ( t| ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cppid < s:u17:1btn: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | ) RunWorkColl(_).ruSn(tiId, sMubtnP, woLrk);E | ^ ,/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp :12:41: note: )in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | D EFIN| E_nc^clDe vFun/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hc(AllReduce_:RING611_SIM:PLE_62SumP:os tDivnote: _u32expanded from macro 'DEFINE_ncclDevFunc'_2, nccl FuncAllR611educ | e , Fu RunWorkBatch, algo, proto, u /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ nroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_4, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_4, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_4, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_4, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x),/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunW/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tiodrkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_4, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tiInBlock(threadIdx.x), group(grd(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ oup), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_4, ncclFuncAllReduce/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_4, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), , FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROnote: TO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u32_2, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(| ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), g/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u32_4, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(Sizes[NCCL_PROTO_SIMPtid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_4, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ LE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | rduop, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ nRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u32_4, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), ti 611 | RdInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWounWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hrkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_4, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_4, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u32_4, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeo/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Protof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_4, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, pc<1, NCCL_MAX_DEV_ARITY>, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565roto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subt nAlgo, Proto, COLL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u32_4, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: ) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_4, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_4, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_4, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | st/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_4, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ epSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_4, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u32_4, ncclFuncAllReduce, F18 warnings generated when compiling for host. uncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_4, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_S/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u32_4, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ umPostDiv_u32_4, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u32_4, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads),: stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u32_4, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDi | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ v_u32_4, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u32_4, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u32_4, ncclFuncAllReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for gfx1200. 18 warnings generated when compiling for gfx1102. 18 warnings generated when compiling for gfx1100. 18 warnings generated when compiling for gfx1101. 18 warnings generated when compiling for gfx1030. 18 warnings generated when compiling for gfx942. 18 warnings generated when compiling for gfx906. 18 warnings generated when compiling for gfx1201. 18 warnings generated when compiling for gfx908. 22 warnings generated when compiling for gfx90a. [ 68%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w =In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ IZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ 2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.chIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channannelId - work->channelLo; | ^~~ elId - work->channelLo; | ^~~ 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | In file included from barrier_by_group(); /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const| ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | coinst intn w = thrteadIdx .xbid = ncclShm/WARP_SIZE; \ | ^ em.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint3/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ 2_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_2, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_2, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_SumPostDiv_u64_2, ncclFuncAllReduce, FuncSumPostDiv, uint64/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? _t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ ncclShmem.comm.buffSizes[NCCLIn file included from _PROTO_S/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hI:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173M: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor]P 670 | L tid(tidE), nthr]eads(nth/reads),N tidInBCCL_STEPlSock(thre/adIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stesizepof(T) : stepSize_) { | S ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hi:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested herez 254 | Primitives, /*Direct=*/0, Proto, 0> per_ == 0 ?i ncclShmmem.comm.busffSizes[ NCCL_PRO In file included from | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5TO_SI:MPLE]/ note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> dOp, ProtoSimple<1, 1, COLL_UNROLL>, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subt/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:n2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11): In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: Rwarning: initializer order does not match the declaration order [-Wreorder-ctor] 670u | tnid(tid)W, nthroeads(nrthreadks), tCprims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_2, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ oll, | ( ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) .671 | run(tid, subtn, st epSiwze(soteprSizek_); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp: ==7 0 ?: ncc1lShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_2, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62:/s izeonote: f(T) : sexpanded from macro 'DEFINE_ncclDevFunc'tepS ize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group611 /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h | :254:90 : note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | PRrimuitivneso, /l*Dirlect=,*/ 0, Ptrotoy, 0> prims , | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hre:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthSimple<1, 1, COLL_UNROLL>, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllRreades(ntdhreauds), ctidIe_TREEnBl_ock(SItMhrPLE_SumPostDiv_u64_2, ncclFuncAllReduce, FuneadIcdx.xS), guroup(mgrouPp), o | ^~~~~~~~~~~~~~~~~ s/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670t:60: Dnote: field 'group' will be initialized after field 'stepSize' i670 | v ti,d(ti uint64_t, NCCL_ALGO_TRdE),E n,thr eadNs(nCthrCeadLs),_ tiPdInRBlocOk(tThreOa_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBadtIdxc.In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_2, ncclFuncAllx)h, , algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(ti29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ d), nthreads(nthreads), tid| warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(gInBlock(threadIdx.x)roup), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbuff, work->redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_SumPostDiv_u64_2, ncclF, group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ uncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ Reduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_2, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, nthr,eads(nt hreads/), tid*InBlocDk(threiadIdrecx.xt)=*/0, Prot, grooup(gr,oup 0> prim), s| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565 671 | : st5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | epSize(stepSirze_ =u=nTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here _63 | UPrimitNivROLL>().run(tid, subes,k 0,) Pro;to, 0> p rim| s | ^ ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h :558:/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here :558 | 7 ru:nRin1g, 0, 2, 2>::run' requested hereo, C OL 7 | DL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here EFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_2, ncclFuncAllReduce, FuncSumPostD432 | i v if, (ti d u< suibtn)n RunWtorkC6oll<4Fn, _T, RtedOp,, Al go, NProtoCCL_A, COLLL_GO_TREUENR, NCCL_PROTO_OSLL>I().MrunP(tiLd,E su,btn, wo2rk)); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:12611:1 | : note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DERunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(thrnceSumaPosdtDiIv, duixnt6.4_tx, N)CCL,_AL GO_gRINrG, oNCCuL_PpROT(grOo_SuIMPpLE,) 2) , | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62 : note: | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | expanded from macro 'DEFINE_ncclDevFunc' t611 | RiudnWo(rkBtatichr, aelgoa, pdsrot(onthreads), tidInBlock, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | t(ithrd(tid), nthreads(nthreeadIdx.x), group(group), | ^~~~~~~~~~~ ads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_SumPostDiv_u64_2, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_2, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_2, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(In file included from group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, / | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/s*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: izeonote: f(T) :in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here stepS ize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h432:254:90 | : note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primi tives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_2, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TRESimple<1, 1, COLL_UNROLL>, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:7:E,1 NCC:L_PR OTO_note: SIMPin instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested hereLE, 2 ) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:627: note: expanded from macro 'DEFINE_ncclDevFunc' | 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | D ^EFIN E_nc/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hclDevFunc:(AllR670educ:e_15: note: field 'nthreads' will be initialized after field 'tidInBlock'TREE _SIM PLE_SumP670o | tid(stid), nthreads(nthreads), tidInBlock(threadIdx.x),tDi v_u64g_2, rnccloFuncuAllRepduc(e, FguncSrumPoostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, up), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_2, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_2, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_2, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_4, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_2, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_Suoll, ty, redop, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(groupmPostDiv_u64_2, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u64_2, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_2, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_2, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_2, ncclFuncAllReduce, FuncSumPostDi/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ zes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u64_2, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | :432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllRe d tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_4, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ uce_RING_SIMPLE_SumPostDiv_u64_2, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u64_2, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(tIn file included from hreadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:.2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hx:11: In file included from )/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: ,/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] g 506 | r otup(group), | ^~~~~~~~~~~ id(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u64_2, ncclFuncAllReduce, FuncS | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_SumPostDiv_u64_2, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PRumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ OTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthrea/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(gds), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u64_2, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ roup), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_4, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u64_2, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RIN/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : G, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_4, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().rungroup), | ^~~~~~~~~~~~~~~~~ (); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:12:1:/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx .note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u64_2, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/size/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: dx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u64_2, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u64_4, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(grIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNRoup), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | OLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_2, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: Primitives, /*Direct=*/0, Prot onote: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_4, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_4, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_4, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_U NROLL>(ttid, nthird(tid), nteads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_4, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl(R).run(Otid, sTubtn, Owork); _| ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cppS:17:1: Inote: MPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17| | DEFIN ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~E_ncc lDevFu nc(Al| lReduc group(groupe /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h_TREE_SIMPLE_:SumP30318:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303ostDiv_u64_4, warnings generated when compiling for host. ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown t, COLL_UNROLy,L red>op, atlgoi, prodto, ,unro ll>(n).rutn(); h\ | r ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.he:670:15a: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ds, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDi ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ v_u64_4, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(gr o 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_4, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ up), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | Ru nWork Coll< Fn, T, RedsOp, Atlgo, eProtop, COLSL_UNRiOLL>()z.run(etid, (subtns, wor/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] tk); | ^e /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:17p:1: note: Sin instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 670 | tid(tid), nthreads(nthreads), tidInBlock 17 | iDEFINzE_nccelDevF_unc( =(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_4, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_A= 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NAllCReducCe_TRELE_SIM_PLE_SSumPoTstDivE_u64_P4, ncSclFun/cAllResduce,i FunczSumPoeofL(GO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ T) : stestDiv, uint64_t, NCCL_ApSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runLGOT_TREEr, NCCeL_PROeTO_SIUMPLE,p 4) D| ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.ho:611:62w: note: expanded from macro 'DEFINE_ncclDevFunc' n 611 | < RunTWorkB,atch< coll,R ty, eredop, Oalgo,p prot,o, un rollP>().rrun();o \ | ^t /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.ho:670:15: Snote: field 'nthreads' will be initialized after field 'tidInBlock' i670 | mtid(tpid),l nthreeads(nthreads)<, tid1InBloc,k 1, COLL_UN(threadIdx.x), groupROLL>, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkCol(lgro().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_4, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u64_4, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x)/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_4, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, workoSimple<1, 1, COLL_UNROLL>, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_4, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_2, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ oto, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u64_4, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u64_4, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_4, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u64_4, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u64_4, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_4, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h == 0 ? ncclS:670:15: warning: hmem.cinitializer order does not match the declaration order [-Wreorder-ctor] om 670 | m if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_4, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ .b tid(tid), nthreaduffSizess[NCCL_PR(OTO_SIMPnLE]/Nthreads), tidInCCL_STEPS/sizeof(TBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_) : stSepSize_) I{ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | M group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63P:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here L63 | PrEimitives<]T, RedOp/, FanSymNmetric<1>C, 0, ProCto, 0> prLi_Sms | T ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | E runPS/sizeof(T)Ring:(tid,90 nth:r note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitiveseads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here , /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:432 | 5 :if (t id < note: subtnin instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here) Run WorkC oll(u).runn(tid,T subtn,r work)eeUpDown, 1, 2, 4>::run' requested here ProtoSimple<1, 1, COLL_UNROLL>, C O 22L | DELFIN_E_nUcclNDevRFunOc(ALllRLedu>ce_(RINGt_SiIMPLE_SdumP,os nthreatDidv_us64_,4 work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78:, ncclFuncAllReduce, FuncSu note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkCollSIM(PLE), .4) r | ^ u/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:n611:62(: note: texpanded from macro 'DEFINE_ncclDevFunc' i611 | d ,Run Wosubtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPorskBattchD,_ al4go,, pr oton, ucnrocll>l().Frunu();n \ c | ^A /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hl:670l:15:R note: field 'nthreads' will be initialized after field 'tidInBlock'e d670 | u ctide(ti,d), nthreads(nthreads), tFuncSumPoidsIntDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, Bl4ock)(t hre adI| dx.^x), gr/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.houp(g:rou611p),: | 62 ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h::670:60: note: field 'group' will be initialized after field 'stepSize' note: expanded from macro 'DEFINE_ncclDevFunc'670 | tid(t611id | ), nth rea ds( nthRrunWorkBatecadsh), , algo, proto, unroll>()Idx..x)r, gurounp(g(rou)p),; | ^~~~~~~~~~~ \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u64_2, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u64_4, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u64_4, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u64_4, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_4, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u64_4, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u64_4, ncclFuncAllReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for gfx1030. 18 warnings generated when compiling for gfx908. 18 warnings generated when compiling for gfx1200. 18 warnings generated when compiling for gfx1101. 18 warnings generated when compiling for gfx1100. 18 warnings generated when compiling for gfx1102. 18 warnings generated when compiling for gfx1201. 18 warnings generated when compiling for gfx906. 18 warnings generated when compiling for gfx942. 22 warnings generated when compiling for gfx90a. [ 68%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable]/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, In file included from flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] In file included from 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable]In file included from 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35::218:In file included from warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ 11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ :15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:27:15: warning: unused variable 'bid' [-Wunused-variable] 27 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:218:15: warning: unused variable 'bid' [-Wunused-variable] 218 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:366:15: warning: unused variable 'bid' [-Wunused-variable] 366 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_2, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:421:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 421 | prims(tid, nthreads, tree->down, tree->down, work->sendbuff, work->recvbuff, work->redOpArg); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_SumPostDiv_u8_2, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:461:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 461 | prims(tid, nthreadsSplit, tree->down, &tree->up, work->sendbuff, work->recvbIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: uff, work-/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_2, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize'>redOpArg, 0*Proto::MaxGroupWidth); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_SumPostDiv_u8_2, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:503:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 503 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, work->sendbuff, work->recvbuff, | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1070:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128, 2>' requested here 1070 | runTreeSplit(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 0, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(AllReduce_TREE_LL128_SumPostDiv_u8_2, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_2, ncclFuncAllReduce, /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] FuncSumPostDiv, u i670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_2, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ nt8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_2, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_2, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl()/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] .run(tid, 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u8_2, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_2, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInB/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hloc:670:15:k warning: initializer order does not match the declaration order [-Wreorder-ctor] (670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMthreadIdx.x), group(group), | ^~~~~~~~~~~ PLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_2, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_P> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ evFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_2, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_2, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u8_2, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_2, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_2, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSiIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_2, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ze_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:7:1:e ads(ntnote: hreadin instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested heres), t idInB lock(threadI7dx.x | ), grDoup(gEroup),F | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ I | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ N671 | E step_Size(nstepcSize_c == 0l ? ncDclShmeevFum.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeonfc(Al(lReducTe_TRE)E_SIM PLE_S:umPost Div_us8_2,t nccelFuncpAllRSieduce, FuncSumPostDiv, uint8_t, NCCL_ALzGe_)O {_ T| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group R/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254E:90: note: Ein instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | , NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatc h r, /*Dedop, algoirec,t=*/0 , Propto, r0> prioms | t ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.ho, unrol:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown(,).run (); \P | ^ r/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15o: note: field 'nthreads' will be initialized after field 'tidInBlock' t 670 | o tiSimple<1, 1, COLL_UNROLL>,d(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670 COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_4, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_2, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:1062:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 1062 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:10:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 10 | DEFINE_ncclDevFunc(AllReduce_RING_LL128_SumPostDiv_u8_2, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_2, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_2, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h== 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdxIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u8_2, ncclFuncAllReduce, Func.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(step, 0>S prims | i ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:z5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565e | runT_reeUpDow n, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ >? ncclShmem.co, COmLL_UNRmOLL>(ti.buffSizes[NCCL_d, nthrePads, worROTO_SIMPLE]/NCCL_STEPS/sik); | ^z /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78eof(T): note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_4, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nt : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u8_2, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ hreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclD/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_2, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unr:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreadoll>s().ru)n();, \ | ^ tidInBlock(threadIdx.x),/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h :670:15g: note: field 'nthreads' will be initialized after field 'tidInBlock' r 670 | o tiud(tidp), nt(hreadsg(nthrreadso)up), ,tid | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stIneBlockpSize(stepSize_ ==(thr eadId0x.x), grou?p(gro up), n| ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hc:670:60:c note: field 'group' will be initialized after field 'stepSize' lShmem 670 | . com tidm.buffSizes[N(Ctid),C nthrLeads(_PROTO_SIMPLE]/NCCL_STEPS/nthresads),i tidInzBlocke(threoadIdxf/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h(T) : ste.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tpSizei_) { d | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ I| group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hn:63:56:B note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here l63 | o Pricmitiveks, 0,x P), groto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_2, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Op, Proto, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u8_2, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | s/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_4, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(TIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_2, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_2, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u8_2, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] _SIMPL 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u8_2, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ E, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_4, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCC/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ L_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | ti/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthrd(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(groupe)ads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u8_2, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_4, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u8_2, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadI/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u8_4, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ dx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u8_2, ncclFuncAllReduce, FuncSumPostDiv, uint8/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_4, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreadty, redop, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ s(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_4, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 2>, 2>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_2, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(grou/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hp), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | ste:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), g/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hroup:670:pSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | PrimitivesthreadM,Idx.x),P groupL/(groupE*), | D ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | ir]/eN tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ cCCL_ST671 | E sPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254te:pSize(90stepSiz:e_ == 0 ? nccnote: lShmem.in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested herecomm.bu ffSize 254 | s[NCC L_PRPrimitives prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_4, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ etric, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5 :{ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.hnote: :in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | 565Primi | tives , COLL_UNROLL>ITY>(, /*Ditrect=i*/0, dProto,, 0> p rims n | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.ht:565:5h: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here r 565eads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: | runote: nTrein instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested hereeUpD own< T, RedOp432, Pr | otoS impl e<1 if , 1(, COLtL_iUNROLL>,d < subtn) RunWorkColl,(t Proto, COid,L nthLread_s, wUork)N; R| ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hO:432:L78: note: Lin instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here >(432) | .run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc( A lif l(tidR < seubtnd) RuunWorckColle()L.runE_SumPostDiv_u8_4, ncclFuncAll(tiRd, seubtnd, wourk);c | ^ e/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:,17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here F17 | DuEFIncSumPostDiv, uNE_ncclDevFunc(AllReduint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: ce_TRexpanded from macro 'DEFINE_ncclDevFunc'EE_S IM PLE_Sum611Post | Div_ u8_4, ncc lFun cAllRReduuce, nFuWncSoumPorstDikv, uBint8a_tch, ta, NlCCLg_ALGOo_TRE,E, NCCL_pPROTrO_SIoMPLEto, unroll, >4) (| ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h):611.run(); \ | :62: ^note: expanded from macro 'DEFINE_ncclDevFunc' 611/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h | :670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthrRuenWorakBatdch, arlgoe, pads), tidInBloroto, unroll>().ruck(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670n:(); 60\ | : ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:note: 670:15:field 'group' will be initialized after field 'stepSize' note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(t670id), | nth read s(n thre ads), ttid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), grouipdIn(Blockg(thrreoadIdxu.x),p g), | ^~~~~~~~~~~ roup(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u8_4, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_4, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl()./builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_4, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u8_4, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nt/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hhreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 t | id(ti d), n threa ds(nt htid(tireadds),) tid, nthreads(nthreads), tidInInBBlockl(threoadIdcx.xk), g(roupt(grohup)readIdx.x), gro, | u ^~~~~~~~~~~ p(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) Ru/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_4, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ E_SumPostDiv_u8_4, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proOtp, Algo, Proto, COLL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_4, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ o, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x)/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h, group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_P18 warnings generated when compiling for host. ROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 12 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u8_2, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_4, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_4, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(ntnote: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432hreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown | if (tid < subtn) RunWorkColl(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_4, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().rProto, COLL_UNROLL>un(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBl().run(tid, subtn,ock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u8_4, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u8_4, ncclFuncAllReduce, FuncSumPostDiv,, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthrea uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ds(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_4, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_4, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:254:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 254 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_4, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u8_4, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u8_4, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u8_4, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u8_4, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u8_4, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:303:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 303 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:565:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1, 4>, 4>' requested here 565 | runTreeUpDown, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 0, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:17:1: note: in instantiation of member function 'RunWorkBatch, 0, 2, 4>::run' requested here 17 | DEFINE_ncclDevFunc(AllReduce_TREE_SIMPLE_SumPostDiv_u8_4, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_TREE, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:63:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 63 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/all_reduce.h:558:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 558 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp:22:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 22 | DEFINE_ncclDevFunc(AllReduce_RING_SIMPLE_SumPostDiv_u8_4, ncclFuncAllReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 18 warnings generated when compiling for gfx906. 18 warnings generated when compiling for gfx1102. 18 warnings generated when compiling for gfx1101. 18 warnings generated when compiling for gfx1030. 18 warnings generated when compiling for gfx908. 18 warnings generated when compiling for gfx1200. 18 warnings generated when compiling for gfx1201. 18 warnings generated when compiling for gfx1100. 18 warnings generated when compiling for gfx942. 22 warnings generated when compiling for gfx90a. [ 68%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/alltoall_pivot_sum_i8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/alltoall_pivot_sum_i8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/alltoall_pivot_sum_i8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/alltoall_pivot_sum_i8.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | coIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ nst int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_grou/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cppIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const ip(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ nt w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21:uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: ; \ | ^ unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1,In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ data2, flag2; | ^~~~~ data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_byIn file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: _group(expanded from macro 'barrier_by_group'); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h :29:15: note: expanded from macro 'barrier_by_group' 29 | cons29t int w | = thread Idx.x/WAR P_SIZE; \ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] const int w = threadIdx.x/WAR 80| ^ | barIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Ofrier_by_group(P)_SIZE; \;f | ^ set; | ^~~ | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t*/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp: 2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10p: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.ht:271:19: warning: unused variable 'ptr' [-Wunused-variable]r 271 | ui=nt64_t* ptr r= recvPetr(0)+lcl128Ofvfset; | P ^~~ tr(0)+ll128In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZEIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; ; \ | ^ | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ int64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:37:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 37 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:82:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 82 | runRing(tid, nThreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(AllToAllPivot_RING_SIMPLE_Sum_i8_2, ncclFuncAllToAllPivot, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:37:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 37 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:82:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 82 | runRing(tid, nThreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(AllToAllPivot_RING_SIMPLE_Sum_i8_2, ncclFuncAllToAllPivot, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STE/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:37:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 37 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:82:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 82 | runRing(tid, nThreads, work); :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem. | c ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:o78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here m432 | mif (tid. < subtn)b RunWorkCuoll().reun(tid, ssubtn, w[ork); | ^N /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:3:1:C note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 3 | DECFINE_ncclLD_PROTO_SIMPLE]/NCCL_STEPS/sizeofevFun(c(AllToAlTlPivot_R)ING_SIMPL E_Su: sm_i8tepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:37:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 37 | Primitives, 0, Proto,_ 2, ncclF0uncAllToA>llPivot, FuncSum,p int8_t, NrCCL_ALGiO_RING, mNCCL_PROTsO_SIMPLE , 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611| ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:82:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 82 | runRing, COLL_UNROLL>(tid, nThreads, work); algo| ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | , proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:4:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 4 | DEFINE_ncclDevFunc(AllToAllPivot_RING_SIMPLnEthrea_ds(nthSreadsu), mtidInB_lock(ithrea8dIdx._x), g4roup(,group) , | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670n:60: note: cfield 'group' will be initialized after field 'stepSize' 670c | tlid(tiFd), nuthreads(ntnhreads)c, tidAllToAllPivot, FuInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:37:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 37 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:82:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 82 | runRing(tid, nThreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(AllToAllPivot_RING_SIMPLE_Sum_i8_2, ncclFuncAllToAllPivot, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:37:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 37 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:82:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 82 | runRing(tid, nThreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(AllToAllPivot_RING_SIMPLE_Sum_i8_2, ncclFuncAllToAllPivot, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:37:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 37 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:82:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 82 | runRing(tid, nThreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(AllToAllPivot_RING_SIMPLE_Sum_i8_2, ncclFuncAllToAllPivot, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:37:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 37 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:82:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 82 | runRing(tid, nThreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(AllToAllPivot_RING_SIMPLE_Sum_i8_2, ncclFuncAllToAllPivot, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:37:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 37 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:82:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 82 | runRing(tid, nThreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:4:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 4 | DEFINE_ncclDevFunc(AllToAllPivot_RING_SIMPLE_Sum_i8_4, ncclFuncAllToAllPivot, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:37:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 37 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:82:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 82 | runRing(tid, nThreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:4:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 4 | DEFINE_ncclDevFunc(AllToAllPivot_RING_SIMPLE_Sum_i8_4, ncclFuncAllToAllPivot, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:37:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 37 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:82:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 82 | runRing(tid, nThreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(AllToAllPivot_RING_SIMPLE_Sum_i8_2, ncclFuncAllToAllPivot, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:37:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 37 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:82:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 82 | runRing(tid, nThreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:4:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 4 | DEFINE_ncclDevFunc(AllToAllPivot_RING_SIMPLE_Sum_i8_4, ncclFuncAllToAllPivot, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:37:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 37 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:82:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 82 | runRing(tid, nThreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(AllToAllPivot_RING_SIMPLE_Sum_i8_2, ncclFuncAllToAllPivot, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:37:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 37 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:82:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 82 | runRing(tid, nThreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(AllToAllPivot_RING_SIMPLE_Sum_i8_2, ncclFuncAllToAllPivot, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:37:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 37 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:82:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 82 | runRing(tid, nThreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:4:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 4 | DEFINE_ncclDevFunc(AllToAllPivot_RING_SIMPLE_Sum_i8_4, ncclFuncAllToAllPivot, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ (tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:37:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 37 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:82:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 82 | runRing(tid, nThreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(AllToAllPivot_RING_SIMPLE_Sum_i8_2, ncclFuncAllToAllPivot, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:37:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 37 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:82:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 82 | runRing(tid, nThreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:4:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 4 | DEFINE_ncclDevFunc(AllToAllPivot_RING_SIMPLE_Sum_i8_4, ncclFuncAllToAllPivot, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:37:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 37 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:82:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 82 | runRing(tid, nThreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:4:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 4 | DEFINE_ncclDevFunc(AllToAllPivot_RING_SIMPLE_Sum_i8_4, ncclFuncAllToAllPivot, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:37:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 37 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:82:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 82 | runRing(tid, nThreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:4:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 4 | DEFINE_ncclDevFunc(AllToAllPivot_RING_SIMPLE_Sum_i8_4, ncclFuncAllToAllPivot, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:37:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 37 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:82:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 82 | runRing(tid, nThreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:4:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 4 | DEFINE_ncclDevFunc(AllToAllPivot_RING_SIMPLE_Sum_i8_4, ncclFuncAllToAllPivot, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:37:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 37 | Primitives, 0, Proto, 0> prims /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:37:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 37 | Primitives, 0, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:82:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 82 | runRing(tid, nThreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:4:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 4 | DEFINE_ncclDevFunc(AllToAllPivot_RING_SIMPLE_Sum_i8_4, ncclFuncAllToAllPivot, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(thread| ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/alltoall_pivot.h:82:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 82 | runRing(tid, nThreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/alltoall_pivot_sum_i8.cpp:4:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 4 | DEFINE_ncclDevFunc(AllToAllPivot_RING_SIMPLE_Sum_i8_4, ncclFuncAllToAllPivot, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx90a. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1200. [ 69%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/broadcast_sum_i8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/broadcast_sum_i8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/broadcast_sum_i8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/broadcast_sum_i8.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:1: In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 18 warnings generated when compiling for gfx908. 18 warnings generated when compiling for gfx1102. 18 warnings generated when compiling for gfx906. 18 warnings generated when compiling for gfx1100. 18 warnings generated when compiling for gfx1030. 18 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ | const int w = threadIdx.x/WARP_SIZIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ E; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ int32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t datIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ a1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:19:15: warning: unused variable 'bid' [-Wunused-variable] 19 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ 10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.hIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | 18 warnings generated when compiling for gfx1201. const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:19:15: warning: unused variable 'bid' [-Wunused-variable] 19 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ 2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:19:15: warning: unused variable 'bid' [-Wunused-variable] 19 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from barrier_by_gr/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ oup(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const SiIZE;n \ | t ^ In file included from w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | bunused variable 'flag1' [-Wunused-variable] a 145 | r uint32_t drata1i, flaeg1, darta2, _flag2b; | ^~~~~y /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145_:28: warning: unused variable 'data2' [-Wunused-variable]g r145 | o uint3u2_t dpata1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: 1In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:10: ,In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14:f warning: unused variable 'data1' [-Wunused-variable] l145 | a uignt132_t d,ata1, flagd1, daata2, tflag2;a | ^~~~~ 2/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:,21: warning: unused variable 'flag1' [-Wunused-variable] 145 | f ulint32_at datga1, f2lag1,; da ta2, flag2| ; | ^~~~~ ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145 :28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint();3 | ^~~~~~~~~~~~~~~~~~ 2/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:_15: note: expanded from macro 'barrier_by_group't 29 | codnst iant w t= thraeadId1x.x/WA,RP_SI ZE; \f lag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35| ^ : warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | 271 | u int 64_ t* pt r = re cvP tr(0u)+liln128tOff6set4; | _ ^~~ t* ptr = recvPtr(0)+llIn file included from 128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | bIn file included from arrier_b/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^y_g rouIn file included from p(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:19:15: warning: unused variable 'bid' [-Wunused-variable] 19 | const int bid = ncclShmem.channelId - work->/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cppc:2: hIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:10a: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:n145:14n: warning: eunused variable 'data1' [-Wunused-variable] l145 | L ouin;t32 _t dat| a1, ^~~ fl ag1, data/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp2, :fla2g2;: | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h::14519:21:: warning: unused variable 'flag1' [-Wunused-variable]In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:19:15: warning: unused variable 'bid' [-Wunused-variable] 19 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ 15: warning: unused variable 'bid' [-Wunused-variable] 19 | 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] const int bid = ncclShmem.channelId - work->channelLo; | ^~~ 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, daconst int ta2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h 271 | : 29 : ui15nt6:4_t * pnote: tr expanded from macro 'barrier_by_group'= r ecv Ptr(029)+l | l12 8Of fset ; const int w = threadIdx.x/WARP_SIZE; \ | ^ | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:19:15:18 warnings generated when compiling for gfx1101. warning: unused variable 'bid' [-Wunused-variable] 19 | const int bid = ncclShmem.channelId - work->cIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:19:15: warning: unused variable 'bid' [-Wunused-variable] 19 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ hannelLo; | ^~~ 18 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128OfIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ fset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:19:15: warning: unused variable 'bid' [-Wunused-variable] 19 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:19:15: warning: unused variable 'bid' [-Wunused-variable] 19 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:19:15: warning: unused variable 'bid' [-Wunused-variable] 19 | const int bid = ncclShmem.channelId - work->channelLo; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:60:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 60 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:97:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 97 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Broadcast_RING_SIMPLE_Sum_i8_2, ncclFuncBroadcast, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:60:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 60 | prims(tIn file included from id, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:97:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 97 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl_(SI)MP.LEr]/uNCnCL(_StTEiPSd/s,iz eosf(ubtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunT) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:60:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 60 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuc(Broadcast_RING_SIMPLE_Sum_i8_2, ncclFuncBroadcast, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatchr, ,0, waorlk-g>oco,nn Inpderx,o wtorok-,>connI nduexn);r o| ^l /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.hl>:97(:5): note: .in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here r u97 | n ( )ru;nRi ng\(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h :670 :15 : note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | i ft id((ttidi), dnt hr().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Broadcast_RrIoNG_SupI)M,P L E| _ ^~~~~~~~~~~S um_i8_2, ncclFuncBroadcast, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:60:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 60 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:97:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 97 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Broadcast_RING_SIMPLE_Sum_i8_2, ncclFuncBroadcast, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:60:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 60 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:97:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 97 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Broadcast_RING_SIMPLE_Sum_i8_2, ncclFuncBroadcast, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:60:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 60 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:97:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 97 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Broadcast_RING_SIMPLE_Sum_i8_2, ncclFuncBroadcast, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:60:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 60 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:97:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 97 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Broadcast_RING_SIMPLE_Sum_i8_2, ncclFuncBroadcast, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:60:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 60 | prims(tid, nthreads, &ring->In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:60:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested hereprev, &ring->next, inputBuf, outputBuf/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ 60 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:97:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 97 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Broadcast_RING_SIMPLE_Sum_i8_2, ncclFuncBroadcast, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:97:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 97 | runRing(tid, nthreads, :work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Broadcast_RING_SIMPLE_Sum_i8_4, ncclFuncBroadcast, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:60:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 60 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:97:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 97 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(BroadcasIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:60:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 60 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:97:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 97 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Broadcast_RING_SIMPLE_Sum_i8_2, ncclFuncBroadcast, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreat_RING_SIMPLE_Sum_i8_4, ncclFuncBroadcast, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(dts), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ id), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:60:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 60 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:111:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 111 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Broadcast_RING_LL128_Sum_i8_2, ncclFuncBroadcast, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:60:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 60 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:97:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 97 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Broadcast_RING_SIMPLE_Sum_i8_4, ncclFuncBroadcast, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:60:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 60 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:97:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 97 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Broadcast_RING_SIMPLE_Sum_i8_2, ncclFuncBroadcast, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:60:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 60 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:97:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 97 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Broadcast_RING_SIMPLE_Sum_i8_4, ncclFuncBroadcast, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:10: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:60:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 60 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:97:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 97 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Broadcast_RING_SIMPLE_Sum_i8_2, ncclFuncBroadcast, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:60:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 60 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:97:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 97 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Broadcast_RING_SIMPLE_Sum_i8_4, ncclFuncBroadcast, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:60:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 60 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:97:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 97 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Broadcast_RING_SIMPLE_Sum_i8_4, ncclFuncBroadcast, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(thre/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hadIdx.x), group(g:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | rtoup),i | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ d | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ ( 671 | t sitepSidze(ste)pSize,_ == 0 ? nncclShtmem.cohmm.burffSizees[NCaCL_dPROTO_sSIMPL(E]/NnCCL_StTEPS/hsizeofr(T) :e stepaSized_)s), tidIn { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:60:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 60 | prims(tid, nthreads, &ring->prev, &ring->Block(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NneCxt, CinputBLuf, o_utputPBuf, wRork->rOedOpATrg, /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h0, work->:connI60nd/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:60:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 60 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:97:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 97 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Broadcast_RING_SIMPLE_Sum_i8_4, ncclFuncBroadcast, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ex,: work7->con:nInde x); note: | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.hin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here:97:5 : note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 97 | ru60nRing | prims(tid, nthreads,(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid &ring->prev, &ring->next, inputBuf, outputBuf, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:97:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 97 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Broadcast_RING_SIMPLE_Sum_i8_4, ncclFuncBroadcast, FuncSum,< subtn) RunWorkColl, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ RedOp, Algo, Proto, COLL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Broadcast_RING_SIMPLE_Sum_i8_4, ncclFuncBroadcast, Func:670S:15:u warning: initializer order does not match the declaration order [-Wreorder-ctor]m ,670 | i tind(ttid)8, n_thrtea,ds( nthNreaCds)C, tLidI_nBlock(tAhreLadIGdxO.x)_, gRrouIp(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.bufNG, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ fSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:60:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 60 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:97:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 97 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Broadcast_RING_SIMPLE_Sum_i8_4, ncclFuncBroadcast, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:60:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 60 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/broadcast.h:97:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 97 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/broadcast_sum_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Broadcast_RING_SIMPLE_Sum_i8_4, ncclFuncBroadcast, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 12 warnings generated when compiling for host. 22 warnings generated when compiling for gfx90a. [ 69%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/device_table.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/device_table.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/device_table.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/device_table.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/device_table.cpp In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/device_table.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantIn file included from issa; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/device_table.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/device_table.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/device_table.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/device_table.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/device_table.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/device_table.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/device_table.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/device_table.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/device_table.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/device_table.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 1 warning generated when compiling for host. 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx1201. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx1200. 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx1101. 12 warnings generated when compiling for gfx1201. 1 warning generated when compiling for gfx942. 12 warnings generated when compiling for gfx908. [ 69%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/host_table.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/host_table.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/host_table.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/host_table.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/host_table.cpp 12 warnings generated when compiling for gfx1100. 12 warnings generated when compiling for gfx1200. 12 warnings generated when compiling for gfx1101. 12 warnings generated when compiling for gfx1102. 12 warnings generated when compiling for gfx942. 12 warnings generated when compiling for gfx906. 12 warnings generated when compiling for gfx1030. 13 warnings generated when compiling for gfx90a. [ 69%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/host_table.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/host_table.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/host_table.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/host_table.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint3In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/host_table.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 2_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/host_table.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/host_table.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/host_table.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/host_table.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/host_table.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/host_table.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 1 warning generated when compiling for host. 1 warning generated when compiling for gfx1201. 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx942. 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx1200. [ 70%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: In file included from unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | In file included from uint32_t /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12y: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:,18: warning: unused variable 'y' [-Wunused-variable] 77 | uhint32_t y,e head, maantissa; d | ^ , mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const iIn file included from nt /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: wIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h= t:13: hIn file included from r/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.he:a173dIdx.x/WARP_SI: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hZ:75:7: warning: unused variable 'w' [-Wunused-variable] E75 | ; \ bar| r ^ ier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = thr/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2;eadIdx.x/WARP_SIZE; \ | ^ | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.hIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); In file included from | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x//builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] WARP_SIZE; \ | ^ 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 145 | uint32_t data1, flag1, In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174dat: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threaa2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ dIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] w = thre145adIdx | .x/WA RP_SIZ E; \ | ^ In file included from uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | u/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1i: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from n/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:t145:14: warning: unused variable 'data1' [-Wunused-variable] 3 145 | 2 uint3_2_t datta1, fl ag1, ddataat2,a1, flag f1lag2; , | data2, ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h: 145:f21l:ag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint3 warning: unused variable 'flag1' [-Wunused-variable] 2 145 | _ uint3t2_t dat a1, fdlag1a, datat2,In file included from a1, fla flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | g1, data2, flag2; | ^~~~~ flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthread/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), s), tidI| nBlock( ~~~~~~~~~~~~~~~~~~thread | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZIdx.x)E, grou)p(grou,p), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h :670:60:| note: field 'group' will be initialized after field 'stepSize' ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~670 | tid(tid), nthr eads(n thread| s), warp(tid/WARP_SIZEtidInB loc k(threadIdx508.x), g | roup(g roup), | ^~~~~~~~~~~ flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:3:In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:11: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here : 3 | MSCCL_IMPL_In file included from KERNEL_E/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.hNTRY_F:UNC_DEVR13EDOP_T: YPE(MinIn file included from Max, /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.hhip_bfl:oat16, 173false): ; | ^/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | ms:ccl670R:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | unIn terpret er< type, Func##devredop, ProtoLL128, fti, work); \ | ^ d(tidu), nthrleads(nlthreads)O, tipdIsnBloc>k(threa(cdIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSomim, algzo, woerk(); s\ t| ^ epSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15toSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(thr: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ eadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(g/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ roup), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: In file included from warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, fl/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | ag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, In file included from data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ =In file included from recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fulL_CHUNKSTEPS/MSCCL_SLICESTEPS, MSCCL_SLICESTEPS, 2>, fullOps>(comm, algo, work); \ | ^ lOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx90a. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1101. [ 70%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_f16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_f16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_f16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_f16.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ [ 70%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_f32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_f32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_f32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_f32.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2In file included from ; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:flag1,7 data2:, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.hwarning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group();:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE;/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threa \ | d ^ IIn file included from dx.x/WARP_SIZE;In file included from \ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h: 13:: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:| 1175: ^: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80: 5:In file included from warning: unused variable 'w' [-Wunused-variable]/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h 80 | : 13 barrier_b: y_gIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ roIn file included from up(); /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h| ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h::29:15: note: expanded from macro 'barrier_by_group'174 In file included from : /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, da 29 | t const aint w 2= thr,eadIdx.x/WARP_SIZE; \ | ^ flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ :28: warning: unused variable 'data2' [-Wunused-variable] 145 | In file included from uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp :1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13d: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.ha:145:14: warning: tunused variable 'data1' [-Wunused-variable] 145 | a ui1nt32_t ,data1, flag1, fdatla2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:a145:21: gwarning: unused variable 'flag1' [-Wunused-variable] 1451 | ui,nt32_t data1d, flaga1, datta2, flag2; | a ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:1452:28: warning: unused variable 'data2' [-Wunused-variable] , 145 | f uint32_t data1, flag1, data2, flag2; lag2; | ^~~~~ | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_groIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ up(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, fIn file included from la/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ g2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMinMax<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, half, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, half, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WA tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509RP_SIZE | ), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMinMax<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, half, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE'/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:3: 1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMinMax<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERN384EL_ENTRY | _FUNC_DE VREDOP_TY PE(MinMmax, halfs, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); cclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMinMax<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNCIn file included from _DEVREDOP_TYPE(MinMax, half, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, half, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), Sizes[NCCL_PROTO_SItidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ MPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, half, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>:508(:29: cwarning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] o506 | m m ,t algo, id(tiwd), notrhkre); \ads ( n| t ^h /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | reads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | ti d 507 | ( watrpInBilock(dthrea)dId,x.x /WAnRP_tSIZhE),r | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e | a warp(tid/WARP_SIZE 508ds(nthr | e falads), tidInBlgoThreacd((tikd%4)=(=threadIdx.x), group(group), | ^~~~~~~~~~~ 3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMinMax<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, half, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, half, fa/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:lse); | ^508 /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3:: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 38729 | mscclR:unIn terpretwarning: er, aProtoSimple, fullOps>(comm, walgo, warorpkInBlock(t); \h | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | readIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | tid (tid), nthreads (nthre ads), tsidInBlotck(threeadIdx.x)p, grouSp(grouip), z| ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:e60:(ncclShmem note: field 'group' will be initialized after field 'stepSize' 670. | tcomm.buffSizes[NCCL_PROTO_LL128]/NCCL_Sid(Ttid), nEthreads(PnthreadSs), /tidInBslock(ithrzeof(uieadnIdx.x),t 6g4r_otu)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.hp(group), | ^~~~~~~~~~~ :199:57: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMinMax<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, half, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, half, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, half, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMinMax<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, half, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMinMax<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, half, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, half, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, half, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMinMax<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, half, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMinMax<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, half, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmeIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, half, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ m.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMinMax<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, half, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, half, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, half, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_byIn file included from _group()/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ ; | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13In file included from : In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7:/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp :1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: warning: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ unused variable 'w' [-Wunused-variable] 75 | In file included from barrier_by/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ :1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ : In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cppIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, float, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, float, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, float, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, float, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, float, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, float, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, float, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, float, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, float, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, float, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, float, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, float, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, float, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, float, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, float, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, float, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ 11In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, float, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, float, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, float, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, float, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, float, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, float, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx90a. [ 70%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_f64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_f64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_f64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_f64.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp 11 warnings generated when compiling for gfx90a. [ 71%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_f8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_f8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_f8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_f8.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable]In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ :75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ ag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_grouIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ p(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, datIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, ad2, flag2; | ^~~~~ ata2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group()/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.hIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ ; | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: uint32_t dataunused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uinIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ : In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | t32_t data1, flag1, data2, flag2; | ^~~~~ uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, double, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, double, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, double, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, double, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, double, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, double, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, double, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, double, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, double, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_I | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, double, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ SCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, double, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, double, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, double, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, double, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, double, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ >, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h tid(tid):508,:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | n tid(titd), hreads(nthrnthereads(nathrds),e ads)tidIn, Bwid(loctikd%(threadIdx.xWARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threa), grodup(grouIp), d| ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670x:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), .x| /WARP_S ^~~~~~~~~~~IZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, double, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, double, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, double, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, double, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ S, MSCCL_SLICESTEPS, 2>, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, double, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, double, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, double, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29In file included from | const int w = threadIdx.x/WARP_SIZE; \ | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp ^ :1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w =In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group()In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ ; | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75In file included from | barrier_by_groupIn file included from (); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 7: In file included from warning: unused variable 'w' [-Wunused-variable] 75 | barrier_In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ : In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:f271:19: warning: unused variable 'ptr' [-Wunused-variable]l 271 | a ugint64_t1* ptr = ,recvP tr(0)+ldl128Offaset; | t ^~~ a2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | conIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: st int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_bIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ y_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint3In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h2:_t data1,75 flag1, d:ata2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h7:145:21: warning: unused variable 'flag1' [-Wunused-variable] : 145 | ui nt32_t dawarning: ta1, flagunused variable 'w' [-Wunused-variable]1, data 2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | 75 uin | t32_t dat a1, flag1 , data 2, flag2; In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | cobarrier_by | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:_35:gnst int w = threadIdx.x/WARP_SIZE; \ | ^ roup(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h warning: unused variable 'flag2' [-Wunused-variable] 145 | uin:t329:15: 2_note: t daexpanded from macro 'barrier_by_group' ta1, flag1, da 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from ta2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128O/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:f1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:fset; | ^~~ 13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h 3 | MSCCL_:508I:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] M PL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPEIn file included from (MinMax506 | ti,d(tid), nthreards(nthrceads), cwid(tid%lWARP_/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp_float8, false); :1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor]SIZ E), wa rp(tid/WARP_SIZE),670 | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx. | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, worncclSkhmem.com)m.buff;Sizes[N CCL_PRO\TO_SIM sPtepS izeL(ncc| lSEhmem ^.]com m.b/uf/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hfSNizes[NCCCL_PROTCO_:L670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670L_ | STEPS/ sizeofL( 128]/ TNCCL_S )TEPStid(tid), : stepSize_) {/sizeof (uint64_ t)| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ ) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primiti nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ note: field 'group' will be initialized after field 'stepSize' 670 | ves< T, Re dOp,tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), FanAgsyrmmetoric, 1(, Prgoto,r 0> oprimsu | ^ p/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:)3:1: ,note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | M SCCL| _IMP ^~~~~~~~~~~L_K ERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.hIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h::508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThrea670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ d((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group)/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1,: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h :670:15: warning: | initializer order does not match the declaration order [-Wreorder-ctor] 670 ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | ti d(tid ), nth| reads( tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_nthread s), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeo 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prif(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rcmcs | l ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp_:f3:1: lnote: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here o3 | MaSCCLt_IMP8L_KER,NEL _ENTfRY_FaUNC_lDEVsREDOeP_TY)PE(M;inM ax, rccl_float8, fa| ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.hl:387:s3: note: eexpanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' )387 | ; ms | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE'ccl Run Interpr387eter | mscclRunInterpreterunc#,#devr edopP, oProttoSimople, fullOps>(comm, algo, CCwL_Sork); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid),LICESTEPS, MSCCL_SLICESTEPS, 2>, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), g ntrhreaods(nuthreapds),( tidgInBlrock(othreuadIdpx.x)), gr,ou p(g rou| p), ^~~~~~~~~~~~~~~~~ | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:60: note: field 'group' will be initialized after field 'stepSize' :670 | 670 t:id(t60id),: nth rnote: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreadeads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ s(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: (tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE( == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | PrimitivesM,inMax, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Mi/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ nMax, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx90a. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx906. [ 71%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_i32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_i32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_i32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_i32.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_11 warnings generated when compiling for gfx1030. by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cppIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ :1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ ZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ int w = threadIdx.xIn file included from /WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, dIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: atIn file included from a2/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h, fla:g2; | 175 ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h: :145/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:28:: warning: unused variable 'data2' [-Wunused-variable]271 :145 | 19 ui:nt3 2_twarning: daunused variable 'ptr' [-Wunused-variable]ta1 , f lag1,271 da | ta2 , f lag2 ; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ : warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ , flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 145In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ :28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.hIn file included from :29:15: note: expanded from macro 'barrier_by_group' 29 | const int w/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 271In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 11 warnings generated when compiling for gfx906. /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ 11 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ pInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. 11 warnings generated when compiling for gfx90a. [ 71%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_i64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_i64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_i64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_i64.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp 11 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 1111 warnings generated when compiling for gfx1100. warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ ZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2,In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.hIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ :29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, In file included from data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 11 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flIn file included from ag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp28:1: :In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.hwarning: :unused variable 'data2' [-Wunused-variable] 13: 145In file included from | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h: 173 : /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hu:75in:7t: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_grou3p2_t( da)ta;1, flag 1, | dat ^~~~~~~~~~~~~~~~~~a2, f/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.hlag2; :| ^~~~~29 /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h::14515::35 :note: expanded from macro 'barrier_by_group'warning: unused variable 'flag2' [-Wunused-variable] 29145 | | c uionnst int w = threadIdx.x/WARP_SIZt3E2_t; d ata\1, f lag 1,| ^da ta2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ uint32_t data1, flag1, data2, flag2; | In file included from ^~~~~/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ [ 71%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_i8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_i8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_i8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_i8.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, CHUNKSTEPS/MSCCL_SLICESTEPS, MSCCL_SLICESTEPS, 2>, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLLMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple(comm, algo, work); \ | ^ SCCL_SLICESTEPS, 2>, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreadsIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] In file included from 77 | u/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: int32_t y, head, mantissa; | ^ warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ In file included from | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:In file included from 13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from :/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h175:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:: 7: warning: unused variable 'w' [-Wunused-variable] /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h75 | barrier_by:_group();80 | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h::29:15: note: expanded from macro 'barrier_by_group' 529 | : warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_grou p const in(t w = thre)adIdx.x/W;ARP_SIZE | ^~~~~~~~~~~~~~~~~~ ; \ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h :29:15: note: | ^ expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ : In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] In file included from 145/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from (); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ ZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int8_t, fIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ alse); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for host. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx90a. 11 warnings generated when compiling for gfx908. [ 72%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_u32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_u32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_u32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_u32.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:112: : In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.hIn file included from :14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h18: warning: unused variable 'y' [-Wunused-variable] : 77 | uint3142_t y: , head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: :77In file included from :18: warning: unused variable 'y' [-Wunused-variable] /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:1477 | : uint/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h32_t y, h:ead, man77tissa; : | ^ 18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.hIn file included from :29:15: note: expanded from macro 'barrier_by_group'/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h 29: | 173 : /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] const75 int w = | th breadIdaxrrier_by_group.x/WAR(P_SIZE; ); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: \ note: | ^ expanded from macro 'barrier_by_group' 29 | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrie/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from r/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174_: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75by:7_: warning: unused variable 'w' [-Wunused-variable] g75 | brarriero_by_grouup(); | ^~~~~~~~~~~~~~~~~~ p/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); (expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \In file included from | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ ); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.hb:29:15: note: expanded from macro 'barrier_by_group'a r29 | cornst int wi = threaedIdx.x/WrARP_SIZE_; \ | ^ In file included from by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.hIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] = thread145Idx.x/ | WARP_S IZE; \ | u ^int32_t d at:13: In file included from a/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:1174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145,:14: warning: unused variable 'data1' [-Wunused-variable] 145 | f uinlt32_t adagta11, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | u, fliag1, dnata2,t flag23; | ^~~~~2 /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:_21: warning: unused variable 'flag1' [-Wunused-variable] t 145 | uindta32t_at1, d fatlaa1g1, data2, flag, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.hIn file included from :145:28: warning: unused variable 'data2' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 145 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145 uint32_t data1, flag1, data2, flag22; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ :14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, ;f | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.hl:145:a35:g warning: unused variable 'flag2' [-Wunused-variable] 1451 | ,uint32 _t datad1, flaag1, dtata2, aflag2;2 | ^~~~~ , flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WIn file included from ARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from : /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.hIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h::13: In file included from 13/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5:: warning: unused variable 'w' [-Wunused-variable] In file included from 80 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h b:ar175rier_: by_/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.hgroup:(); 271 | ^~~~~~~~~~~~~~~~~~ :/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:1915: note: expanded from macro 'barrier_by_group': 29 | warning: counused variable 'ptr' [-Wunused-variable]nst in t w = th271readIdx.x | /WAR P_SI ZE; \ | ^ uint64_t* In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1p: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ tIn file included from r = r/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.hecvPtr(0):+ll117528Offset; : | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | In file included from uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 29 | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ r(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadI/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508dx.x/:WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | 29 fla:gThread(( tid%4)=warning: =3), grofield 'group' will be initialized after field 'stepSize' [-Wreorder-ctor]up(group ), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 506 | 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/siztid(tied), nothreads(fnthreads(), wuid(tid%WiARP_SIZnE), warpt(tid6/WARP_SI4ZE), _ t)| ~~~~~~~~~~~~~~~~~~ ) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h| stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group):199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here, 199 | Prim itives<| T, RedOp ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~, FanA symmetri c<1,1>, | 1, Pro warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3to, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:3:1509: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMP | L s_tepKSize(nEcclShRmNeEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint32_t, false);m.co mm.bu ffSiz| es[NCC^L_PRO TO_/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.hLL128:]/NC384CL_ST:EPS/s3i:zeof (uintnote: 64_expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE't)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> pr384 | msciclRunImnterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ s | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1101. /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(tIn file included from hreadIdx.x/WARP_SI/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cppZ:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13E: ),In file included from | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h: 173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h| warp(tid/WARP_SIZE 508 | flagThrea:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | d tid(ti(d), n(titd%4)==3), ghreadrs(nthreaodsup)(group, tidIn), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCLBl_ock(thPreadIdx.Rx), groOup(groTup), O| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ _671 | sLtepLS128]/NCCizLe(stepSizIn file included from _STE/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cppPS/sizeo:ef_ == 10 ? ncc(: lSIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13uinthm6em.co4m_t)) m.buffSizes{ [N C| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group CL_PROT/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested hereO_SIMPLE ]/NCCL_ 199 | STEP S/ Primsiizeof(tiveT)s : : , 1, Protno, c0> prcims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cppl:3:S1h: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here m 3 | MSeCCL_mI.MPL_KEc Rnote: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here N199oE | mPrimimtLive._sbE, 1ML, Praoto,x 0>, prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cppu:3:1: note: iin instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3n | MSCCtL_IMPL3_K2E_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: RNEL_ENTRY_FUNC_DEexpanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE'VREDO P_T YPE(Mi_STn384EPSM | /si azeox f(T),m : s scculinttR3epSu2i_ze_)t { ,| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n | group(groupI nftale/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.hr:199:p57s: rnote: ein instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here e 199t | Per)ri;1, Pr>otoL,L128 , fu1 lmslcclROu, Proto, 0> prims p | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here s >(comm, algo3, wo | rk);M \ S| ^ CCL_InInteMrprPeterL,EN ProtoSiTmpRle, fullOps_>DE(VREDcOP_oTYPE(MinMmax,m ui,n algo, t32w_t,ork); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreterid(,tid ),P ntrhreoadst(ntohreadSs),i timdInpBlolck(ethrea, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx1200. /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.hIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx908. :508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.hIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:384:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ 11 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 1111 warnings generated when compiling for gfx1102. warnings generated when compiling for gfx942. /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ 11 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx90a. [ 72%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_u64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_u64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_u64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_u64.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx90a. [ 72%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_u8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_u8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_u8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_u8.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: : In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77::18:14 warning: unused variable 'y' [-Wunused-variable] : 77 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h : ui77nt32_t: y, 18head:, man tissawarning: ; | ^unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, heaIn file included from d, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:In file included from 13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:75:1:7: warning: unused variable 'w' [-Wunused-variable] : In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h75:173 | : /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h : barrie75:7: warning: runused variable 'w' [-Wunused-variable] _b75 | y_group(); barrier| _/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80 ^~~~~~~~~~~~~~~~~~b:5: warning: y unused variable 'w' [-Wunused-variable] 80/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h_ | g barrrie:r_byo_gro29up(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.hu:29:15: pnote: expanded from macro 'barrier_by_group' 29( | c)onst i;nt w = t hreadIdx. x/WAR| P_SIZE; \ | ^ :15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:In file included from 1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp32_t:1 : dIn file included from a/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:ta1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:t175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271d | a uint64_tt* ptr =a recvPtr(10)+ll128Off,se flagt; | ^~~ 1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, In file included from In file included from flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h::13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h1:145:14: warning: : unused variable 'data1' [-Wunused-variable] 145 | In file included from uint3/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h2:_13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128OIn file included from ffset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: :In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19:: warning: unused variable 'ptr' [-Wunused-variable] 271 | 15 uint64_t:* ptr = recvPtrnote: (expanded from macro 'barrier_by_group' 0)+ll1 28Offset29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ ; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: Idnote: x.x/WARPexpanded from macro 'barrier_by_group'_SIZE; \ | ^ 29 | const int w = threadIdx.x/WARP_SIZE; \In file included from | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: _t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, daIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:t174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7:a warning: unused variable 'w' [-Wunused-variable] 752 | , barri er_byf_group(); l | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:ag15: note: expanded from macro 'barrier_by_group' 2 29 | co;nst int w = thre | ad ^~~~~ I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: dx.unused variable 'flag2' [-Wunused-variable]x/WARP_SIZ E 145; | \ | ^ In file included from uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_warning: unused variable 'w' [-Wunused-variable] 80 | barrby_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ ier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = reIn file included from cvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | : /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271: 19: warning: unused variable 'ptr' [-Wunused-variable] 271 | ui nt64_t*u ptr = reicvPtr(n0t)32_t data1, flag1, +ll128Offsdet; | ^~~ ata2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.hIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ :175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506In file included from | tid(tid), nthread/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1s: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13(: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthre508ads(nthreads), tidInB | lock(threa dIdx.x), group(g roup), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | f stepSize(stlagThread((tid%4epSize_) == 0 ? nc=clShmem.=co3m),m .group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | buff Sizes[NCCLs_PROTO_SIMtPLE]/NCCLe_SpTEPS/sizeof(T) :Size(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t stepSiz)e_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ ) | group(group { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | P/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.hrimitivesnote: , 1in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | , P roto,Primitives primes | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cppt:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here r3 | MSCCLic<1,1>_,IMPL_KERN EL1_,E Proto, 0NTRY_>FUNC_ prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cppDEVREDOP_TYPE(MinMax, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | tric<1,1>, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ LICESTEPS, MSCCL_SLICESTEPS, 2>, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: In file included from field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670: 15| stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | : warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthread warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ s), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | fl/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | PriagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitimitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:3ves, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL, Red_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.hOp,:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterprete Fanr, ProtoLL128, fullOps>(comm, algo, work); \ | ^ Asymmetric<1,1>, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tiIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ dInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NC 671 | stepSize(stCLepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/size_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ of(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ P_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ | uint32_t In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_groudata1,p(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ flag1, data2, flagIn file included from 2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flaIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+llg1,128Offset; | ^~~ data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cppZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ :1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ expanded from macro 'barrier_by_group' 29 | const int w = thrIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ eadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group)/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ , | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE:508):29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] ,506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthrea/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cppds(nthreads), tidIn:Block(th1readId: x.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173 tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | : /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15:stepSiz warning: initializer order does not match the declaration order [-Wreorder-ctor] e(st 670 | epSiz e_ = = 0t i? ncclIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRd(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreadShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(unInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ s), tidInBlock(threMianMax, uintd8_t, false)I; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.hd:x.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:387670:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' :387 | mscclR60unInterpIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ re:ter< tnote: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ype, Func##devredop, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Pridx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ mitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_MinMax_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(MinMax, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx90a. [ 72%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_bf16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_bf16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_bf16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_bf16.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 11 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 11 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint3211 warnings generated when compiling for gfx1201. _t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 11 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 11 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable]271 | uint64_t* ptr 145= r | e c v P turi(nt32_t data0)+1ll128,Offse flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.ht; | ^~~ :145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13In file included from : In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w =In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | threa dIdx. x/W ARP _SIZEb; \ arr | i ^ er_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 11 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t| ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ * ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2;In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13 | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28:: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; In file included from | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | In file included from c/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cppo:n1s: tIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.hi:n13t : wIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h=: 175t: h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.hre:a80d:Id5x:. x/warning: WAunused variable 'w' [-Wunused-variable]R P_SIZE; \ | 80 ^ | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from 175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h::271:119: :In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13 warning: : In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | unused variable 'ptr' [-Wunused-variable] 271 | ba uirnt64_rt* piter_byr = recvPtr(0)+ll128Offset; | ^~~ _group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: L_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? nccl/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint6t4_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:hreads)199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here , tidInBlo 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ k(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SI:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ ZE), warp(ti11 warnings generated when compiling for gfx90a. d/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: ); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:1: In file included from (/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13g: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:r173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15:o warning: initializer order does not match the declaration order [-Wreorder-ctor] u670 | p )tid,(tid) , n thre| ads(n ^~~~~~~~~~~~~~~~~th r/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.he:670ad:s)60:, note: tfield 'group' will be initialized after field 'stepSize'i d I670n | B lo ck(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? n tid(tid), nthreads(nthrceads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ clShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ [ 73%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_bf8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_bf8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_bf8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_bf8.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp 11 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29In file included from | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; In file included from | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ ; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15:In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, fx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ lag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/W tid(tiAd), nthRreads(nthreads), Pwid(tid%W_ARP_SIZES), warIp(tid/WARZP_SIZE), E | ~~~~~~~~~~~~~~~~~~), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | w | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) a507 | warprInBplInBlock(ock(threatdIhdrxe.axd/IdWx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ ARP_S IZE), | | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE warp(tid/WARP_SIZE508 | flagThread((tid%4)==3), group(group), | 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128 ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | ] warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | / stepSizNe(ncclShmemC.comm.buffCSizes[NCCLL_PROTO_L_L128]/NCCSL_STEPS/siTzeof(uinEt64_t)) { P| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~S/ s| i group(groupz eof(uint64_/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57:t note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunIIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ?/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.hnterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ :508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZ ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ E), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] ) 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WAR==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ P_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:671 | stepSize(stepSize_ == 0 ?1 ncclShme: m.comm.bIn file included from uffSizes[/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.hNCCL:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h_PROTO:_SIM670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] PLE]/NCCL_ 670 | STEPS tid/s(izeoftid)(T) , nthrea: stdepSizes(nthreads), ti_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, ProtLo, 0> prims _ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSSCCL_IMPTL_KERNELE_ENTRY_PFUNC_DEVRESDOP/sizeof_TYPE(P(rodT,) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group rccl_bfloa/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.ht8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Pro#ddevredop, Prot oSimple, fulloOps>(coamm, ta8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.hlgo, work);: \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' :670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 387670 | | tid(tid), nthreads(nthreads), tmscclRuinInterdpreIter, ProtoSimplethrea,dIdx.x) , groupf(group),u | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hl:670:60l: note: field 'group' will be initialized after field 'stepSize' O670 | p tid(stid), n>thre(ads(cntohmm, readsalgo, work); \ ), tidIn| Block(t ^hreadId x/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h.x:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670), | group (group), tid | ^~~~~~~~~~~ (tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | P tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ mmetric<1,1>, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threaIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreaddIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: s), tidInBlock(thfield 'group' will be initialized after field 'stepSize' [-Wreorder-ctor]readIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize 506 | (stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : steIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ pSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for host. 11 warnings generated when compiling for gfx1030. 1111 warnings generated when compiling for gfx1200. warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx90a. [ 73%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_f16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_f16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_f16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_f16.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:group();173 | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ : /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrIn file included from ier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: :29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1In file included from : In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w =In file included from threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ :508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, halIn file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ f, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. 11 warnings generated when compiling for gfx90a. [ 73%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_f32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_f32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_f32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_f32.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t dIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ ata1, flag1, data2, flag2;In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:In file included from 13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ .x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1201. /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE),In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx90a. [ 73%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_f64.cpp.o 11 warnings generated when compiling for host. /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_f64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_f64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_f64.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.hIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14:: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ :14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | constIn file included from int w /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1=: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h: 75:7: warning: unused variable 'w' [-Wunused-variable] threadIdx.x/WARP_SIZE; \ | ^ 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_S/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.hI:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: Z/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable]E 75 | ; barrie r_b\ | ^ y_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ barr ier_by| ^_ In file included from group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1 : In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145145:14: warning: | unused variable 'data1' [-Wunused-variable] 145 | uint3 2_t dat a1, fulagint32_t data1, f1,l data2a, flagg2; | 1/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: ^~~~~In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h,:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h: data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1,/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h: 145:21: warning: funused variable 'flag1' [-Wunused-variable] 145 | lag1, d uinat32_tt dataa1,2 flag1,, dat a2, fflag2;l | ^~~~~ a/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28g: warning: 145:214:unused variable 'data2' [-Wunused-variable] warning: ;unused variable 'data1' [-Wunused-variable] 145 | | 145uint | 32_ ^~~~~t d ata1, /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.hf lag1, ud:atai2145,n :flatg282; 3: | ^~~~~2 /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h_:warning: 145:21:tunused variable 'data2' [-Wunused-variable] warning: unused variable 'flag1' [-Wunused-variable] 145 | d a uint145t32_t | adata 11, fl ag1,, da t af2u, flliag2a; n | g ^~~~~ t/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h1:1453:28:, 2warning: unused variable 'data2' [-Wunused-variable] _145 | d t aui nt3t2d_t aadata1t, f2la,ag1, data1f,l2, flag1ag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable]:145: 35: warning: unused variable 'flag2' [-Wunused-variable] 145 | 145 uint3 | 2_ut :145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ daita1,n flt32_t dataag11,, data 2, fflag2l; | ag1, data2, flag2; | ^~~~~ ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1In file included from : In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | 11 warnings generated when compiling for gfx1201. uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 11 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 11 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx90a. 11 warnings generated when compiling for gfx1200. /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVR/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ EDOP_TYPE(Prod, double, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | t/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ id(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ [NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_f8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_f8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_f8.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), n/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ threads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ e>, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] :508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid506( | t tiid(dtid%), WnthAreads(RnthPre_adsS), IwidZ(Etid)%WA,RP_ SIZwE),a warrp(tpid/(WARtP_SiIZEd), / | ~~~~~~~~~~~~~~~~~~W | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)A R507 | P w_arpSInBIlocZk(tEhre)adI,dx.x /WA RP_| SIZ ~~~~~~~~~~~~~~~~~~E), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | | warp(tid/WARP_SIZE stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 507 | war509pIn | Blo c k( thre adsIdx.tx/WeARPp_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | S warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3ize (nc clShm509em. | com m.b uff Siz es[NsCCLt_PReOTOp_LL1S28i]/NzCCL_eSTE(PS/nsizecof(cuinlt64S_t)h)mem .{ c| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ o | mm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | group(group ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h :199 :57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3| | group(groupM /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.hS:C199:57C: note: Lin instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here _199 | I PrMimiPtivLes<_KT, ERedROp,N FaEnL_ENAsymmetric<1,1>, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, woCCL_IMPL_KrEk); \ | ^ RNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), wa/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ 11 warnings generated when compiling for host. rp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, fIn file included from lag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:t data1, flag1, dat1: aIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:2175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5,: warning: unused variable 'w' [-Wunused-variable] flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ :15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175In file included from : /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13:5: warning: unused variable 'w' [-Wunused-variable] 80: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] In file included from 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threa/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+lldIdx.x/WARP_SIZE; \ | ^ 128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, dat:21: warning: a2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2,In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threaIn file included from dIdx.x/WAR/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cppP:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13_SIZE; \ | ^ : In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t daIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ ta1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1030. 1111 warnings generated when compiling for gfx1102. warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx906. /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx908. /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads)In file included from , wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: sIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: tIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hepSize(ncclShmem.com:670m:15: warning: .initializer order does not match the declaration order [-Wreorder-ctor] b670 | u tidf(tid)f, nthSreadsi(nthrzeads),e tidIsnBloc[k(thNreadIdCx.Cx), gLroup(_groupP), | R ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_O 671TO_LL128]/NCCL_STEPS/sizeo | f stepS(ize(sutepSiize_ == 0 n? nctclShme6m.com4m.buf_fSt)) { izes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/size | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rcclof(_T) : fsteplSizeo_) {a | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t | group(group 8/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:,199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primiftivea/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ lse); | ^ s , 1,note: Prexpanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE'oto, 0 > prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCC384L_IM | PL_K ERNE L_ENTmRY_FUNC_DEVREDOP_TYPE(Prod, rccl_flscoclRat8, false); unI nterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ 3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> p/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.hr:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | PrimitiCL_IMPL_KERNEL_ENTRY_FUves, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ NC_DEVREDOP_TYPE(Prod, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ devredop, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIIn file included from d/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ x.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for host. 11 warnings generated when compiling for gfx90a. [ 74%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_i32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_i32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_i32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_i32.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head,77 | muint32_t ay, nheatd, mantissia; | ^ ssa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group();In file included from | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13 ^~~~~~~~~~~~~~~~~~: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h: 75:7:/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h warning: unused variable 'w' [-Wunused-variable] 75 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZ:29:15: note: expanded from macro 'barrier_by_group'E 29 | c;onst int w = thr\eadIdx.x /WARP_S IZE; \ | | ^ ^ barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | In file included from c/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from onst int w = thread/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:I174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7d: warning: unused variable 'w' [-Wunused-variable] x75 | .barrierx_by_gro/up(); W| ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:A15: note: expanded from macro 'barrier_by_group' R29 | cPo_SIZE; \ | ^ In file included from nst int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h2:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:;174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145: 14: warning: unused variable 'data1' [-Wunused-variable] 145 | | uin ^~~~~t32_t d ata1, /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.hflag1, data2,:In file included from 145/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp::1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:2113: In file included from f/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h::174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.hl: 145:14:a warning: warning: unused variable 'data1' [-Wunused-variable] gunused variable 'flag1' [-Wunused-variable]145 | 2 uin;t32_t data1 , f| lag1, ^~~~~d145at | a2, ui/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:n21: flatwarning: g2; 3unused variable 'flag1' [-Wunused-variable]| 2_t 145 ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h | :d 145ata1, flag1, data2, uifnt32_t datal1, flaag1:21:, warning: unused variable 'flag1' [-Wunused-variable] 145 | g duint23a2_;tt da | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t dIn file included from ata/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp1:a2, 1f,lag2: ; fta1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ lag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.hIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h::174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h145:75:7:: warning: unused variable 'w' [-Wunused-variable] 75 | bar r | ^~~~~ i/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145e:28: rwarning: unused variable 'data2' [-Wunused-variable] _145 | b uyint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable]35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; _gr| oup ^~~~~(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 8080: | 5: warning: unused variable 'w' [-Wunused-variable] 80 | bar rier _by_bgrouap()rrier_by_group(); | ; ^~~~~~~~~~~~~~~~~~| ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h :29:/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h15: note: expanded from macro 'barrier_by_group' :29 | 29 :15: note: expanded from macro 'barrier_by_group' 29 | const intc w o= thnreast int w = thdIdx.x/WARP_SIZE; \ | ^ readIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.hr:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.he:271:19c: warning: unused variable 'ptr' [-Wunused-variable]v P271 | t In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: r ui(nt640_t* )ptr += relcvPtlr(0)1+ll1228Of8fsetO; | f ^~~ fset;175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80: 5:unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ | warning: unused variable 'w' [-Wunused-variable] ^~~80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cppIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ +ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(t/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.hid/WA:508:29R: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506P | _SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZEtid()tid), n,threads (nthrea ds), wi| d(tid%W ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ARP_SIZ E), war p(tid/| WARP_S warp(tid/WARP_SIZEIZE) 508 | flag, | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)T 507 | h read((tid%4)==3) w,arpInBl ock(thrgeadroup(gIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThroup), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives509 | s,tepSize (ncclShm1em.comm,.buffSiz es[NCCPL_PROTO_rLL128]/NCoCL_STEPtS/sizeoof(uint64,_t)) { 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:3:1 | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives:,508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 1506 | , tid (tid)P, nthrreadso(nthrteads)o, wid,(tid%WA: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested hereR 3 | PMSCCL__IMPLS_KERNIEL_ENZTRY_FEUNC_D)EVREDO,P_T 0Y> prwiPms a E| ^ rp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/W(AProd,R int32P_t, f_alse)S; | ^ I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:Z3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' E 384 | ) mscc,lRunIn terp reter| , ProtoLL128, fullOps>(comm, algo, work); \ | ^ in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~| ^ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(In file included from nthread/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from s), ti/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: d/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15InBlock(threadIdx.: warning: initializer order does not match the declaration order [-Wreorder-ctor]x 670 | tid(ti), group(group), d), | nthread ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~s(nthre ads | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | ), ti dInB stepSilock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmemze(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | PrimitivesSTEPS/s,izeof(T ) : ste1pSize_), { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.hP:199:57: note: rin instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here o 199 | to, 0> Prim itpirims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:3:1: ves, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENnote: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ MPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, ProtoSimple, full2Ops>>(c,omm, algfo, uwork)l; \ l | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hO:670:p15: note: sfield 'nthreads' will be initialized after field 'tidInBlock' >670 | ( comm, algo,tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), wo rk); \ | | ^ ^~~~~~~~~~~~~~~~~/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h: 670:15:/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h note: field 'nthreads' will be initialized after field 'tidInBlock' 670: | 670 tid:(tid60), n:thr eadsnote: (nthfield 'group' will be initialized after field 'stepSize'read s), tidInBlo670ck(threadI | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ dx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 1,1>, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterp 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ reter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(reads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreadcomm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ s), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 11 warnings generated when compiling for gfx1102. 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for host. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx90a. [ 74%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_i64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_i64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_i64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_i64.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx908. 1111 warnings generated when compiling for gfx1102. warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx90a. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx942. [ 74%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_i8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_i8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_i8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_i8.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h: 13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75c:7: warning: unused variable 'w' [-Wunused-variable] 75 | o barrnier_by_grousp(); | ^~~~~~~~~~~~~~~~~~t /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | in const itnt w = th w = threadIdx.x/WARP_SIZE; \ | ^ readIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const i:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ nt w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flagag11, data,2, flag2 ; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.hd:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145a | uitnt32_t adata1, 2flag1, d,ata2, f lflag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_ag2; t| ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145 :28: warning: unused variable 'data2' [-Wunused-variable] d 145 | auint32_tt data1a, fl1, flag1, data2,ag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uinIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | t 32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19:/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1 : In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from warning: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271unused variable 'ptr' [-Wunused-variable]:19: 271 | warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = r7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ ecvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | cIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: onst int wunused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable]/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1 75 | : In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174 : /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, dunused variable 'data1' [-Wunused-variable] 29145 | | uint3 2_t data 1, fla const int w = threadIdg1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21:ata2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, dax.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174ta: 2, fl/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.hag2; :| ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145145:28: warning: :unused variable 'data2' [-Wunused-variable] 145 | 14 uin:t32_t data1,warning: flag1unused variable 'data1' [-Wunused-variable], data 2, fl ag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145145:35: warning: | unused variable 'flag2' [-Wunused-variable] 145 | uin t32_t data1, flaug1, daita2, fnlt32_t data1, flag1, data2, flag2; ag2; | ^~~~~ | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_warning: unused variable 'ptr' [-Wunused-variable] b271 | y _ uintg64_t* rptr =o recvPutr(0)+lpl128Of(fset; ) | ^~~ ; | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:529:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ : warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \t hreadIdx.x/WARP_SIZE; \ | ^ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { :508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | ( group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:t57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199i | PrimidtivesR, 1, ProtPo, 0> p:_508:29: warning: rfield 'group' will be initialized after field 'stepSize' [-Wreorder-ctor]S 506 | i Itidm(tidZ), nsthreEads (nth)rea ds), ,w| id(tid% ^WARP_SI Z E), wa| ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:.3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here x3 | MSCCL/_IMPL_KERWNEL_ENTRAY_FUNCrpR(_tid/WARPPD_SIZE)_,E | ~~~~~~~~~~~~~~~~~~ SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThVRErDOP_TYePE(Parod, idnt64_t(, false() ;| stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) t 507 | i wardp| InBl%o^ck(t4 hread)/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.hIdx.=x/WARP_=:SIZE)3387, | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ ): | warp(tid/WARP_SIZE ,3508 | : fla ggThrenote: roup(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterad((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.hp:reter<199ty: stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ 57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, sProtoySimpmleSLIC,ESTE PS, 21>, f,ullO ps>(cPomm,r algoo, wtorko); \, | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h0:670:15:> note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | p rtid(itid),m nthsread s(nt hrea| ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCLds), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(gro_uIMPLp_KER)NEL_,ENTR Y_FU NC_D| EVR ^~~~~~~~~~~EDOP_ TYPE(Prod, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ ic<1,1>, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.hncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KE:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIRdNEL_ENxTRY_FUN.C_DExVREDOP/_TYPE(WProd, Aint64_Rt, falPse); _| ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387S:3: note: Iexpanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | Z mscclERunInt)erpret,er, hProtoSrimpleea, fullOps>( ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Pcomm,r algo,i work);m \ | ^i /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670t:15: note: field 'nthreads' will be initialized after field 'tidInBlock' i 670 | v tid(etid), snthread, 1, Proto| ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h,:670:60: note: field 'group' will be initialized after field 'stepSize' 0670 | >tid(ti d), ntphrims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:3:1: rnote: eads(in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested herenthrea ds), t idInBlock(threadIdx.x), group(group)3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_tIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidIn, false); Block(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscc,l | ^~~~~~~~~~~RunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:3:1In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h::387670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, fIn file included from l/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: ag2;In file included from | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:145:28: warning: :unused variable 'data2' [-Wunused-variable] 145 | 13 uin: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | t32_t d ata1, f lag1, datab2, flaga2; | ^~~~~ r/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: runused variable 'flag2' [-Wunused-variable] 145 | ier_by_gr ouint32_up(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29t :data1, 15flag1, : note: expanded from macro 'barrier_by_group' 29 | const int wdata2, flag2; | ^~~~~ = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uiIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: nt32_t data1, flag1, data2, flag2; | ^~~~~ unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const iunused variable 'data1' [-Wunused-variable]n 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ t w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x),/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.hnote: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8:508:29: warning: _field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | t tid(,tid), n threadfs(nthreaads), wlid(tid%WsARP_SIZeE), warp)(t; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpid/WArRP_SIZEe), | ~~~~~~~~~~~~~~~~~~ t| stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | e warpIrnBlock(t, ProtoLL128, fullOps>(comm, algo, work); \ | ^ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); 3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter< | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(groutype, Func##devredop, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670p:), | ^~~~~~~~~~~ 15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ threads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ 11 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL12In file included from 8, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx90a. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx1200. [ 75%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_u32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_u32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_u32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_u32.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx906. In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1 | : In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h :13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h :174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h :145:14 : warning: unused variable 'data1' [-Wunused-variable]c 145o | nuinst32_tt dat a1, iflagn1, dtata2 /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: w = threadIdx.x/WARP_SIZE; \ | ^ , flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag22,; f la g2| ; ^~~~~ | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145::35145:: warning: unused variable 'flag2' [-Wunused-variable]35 :145 | warning: unused variable 'flag2' [-Wunused-variable] u in t32145_t | d at a1, flag1, data2, flag2; | ^~~~~ uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | In file included from uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | baIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ rrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WIn file included from A/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ RP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* In file included from ptr = /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 11 warnings generated when compiling for gfx90a. [ 75%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_u64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_u64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_u64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_u64.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primit/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ ives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, woIn file included from rk); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:3 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group)In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nth, | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitiveseads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.xIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35:In file included from warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ 11 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] : 50629: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthrea | d tid(tids), nthrea(ds(nthreands), threadwid(tids)%W, wid(tid%WARP_SIZAE), wRP_SIZarpE(tid/), warp(tid/WWARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | wARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3),arpInBl ock(threagdroup(groIdx.x/WuARP_SIZE)p, | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | ) warp(tid/WARP_SIZE 508 | , flagThrea d((tid%4 )| ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3==3), g roup(gro up509 | stepSize(ncclShmem.comm.buffSize),s | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ [ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 N 509 | C steCpSize(LncclSh_mem.cPROTO_LL128]omm./buffSizNes[NCCCL_PROTCO_LL12L8]/NCCL__STEPSS/sizeofT(EPS/suint64_izt)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.heof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto:199:57,: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Pr0imit>ives, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uetiric<1n,1>, t1, 6Proto, 40> p_rims t| ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp,:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here f3 | MSCaCL_IMlPL_KEsRNEL_EeNTR)Y_FUNC;_DEV REDO P_TYP| E(Pro^d, u int64/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h_t, false):; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384384:3: :note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 3384 | mscclRunInterpr: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, aetelr, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIn file included from IMPLE]/NCCL_STEPS/sizeof(/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tT) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTR/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.hY_FUNC_DEVREid(Dtid), ntOhrea:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ ds(Pnthreads_), tidInBTlock(threYadIdx.Px), groupE(group),( | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ P| tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | r stepSioze(stepSdize_ == 0, ? ncclS hmem.comum.buffSiizes[NCCLnt64_t, false);_P ROTO_SIM PLE]/N| ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387CCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple<:M199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ SCCL_CHUNKSTEPS/MSCCL_SLICESTEPS, MSCCL_SLICESTEPS, 2>, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, fal11 warnings generated when compiling for host. se); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx1030. /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx90a. [ 75%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_u8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_u8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_u8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_u8.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx942. 11In file included from warnings generated when compiling for gfx1200. /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mIn file included from antissa; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ : warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 11 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_b: warning: unused variable 'ptr' [-Wunused-variable] 271y_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7:ll128Offset; | ^~~ warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, dataIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPt | const int w = r(0)+ll128Offset; | ^~~ threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ [ 75%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_bf16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_bf16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_bf16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_bf16.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | In file included from barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | cons ctonst inti w =n thrteadI dx.x/wWARP _SIZ=E; \ | ^t In file included from hreadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | wtidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ arpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread(/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ (tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ S, 2>, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTOE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(gro_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPup), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffE(Prod, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Sizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), g11 warnings generated when compiling for host. roup(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Prod_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group'In file included from 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h::13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: 14/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable]: 75 | bawarning: rrier_byunused variable 'data1' [-Wunused-variable]_group() ; | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h :29145 | :15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data11,, dat a2, flfag2; l | ^~~~~ ag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: In file included from unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp :1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h::145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uin13: tIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: 3/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5:2 warning: unused variable 'w' [-Wunused-variable] _80 | t bar riedar_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | ta1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; co nst i nt w | = thr ^~~~~eadId x.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ vPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = thrt* ptr = recvPtr(0)+ll128Offset; | ^~~ eadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35:In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreterIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagTh/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ read((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, hip_bfloat16, fal/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | s e) tid(tid), nthreads(nth; r | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.he:387:3: note: aexpanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' ds), wid387 | mscclRunIn(terpretterA, ProtRoSimpleP_SIZE), f ~~~~~~~~~~~~~~~~~~ullOp s>(c | omm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | field 'nthreads' will be initialized after field 'tidInBlock' w arpInB lock(threadId670x.x/WA | RP tid(_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROtid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ TO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), n11 warnings generated when compiling for host. threads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.In file included from c/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ omm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, hip_bfloat16, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx90a. [ 76%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_bf8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_bf8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_bf8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_bf8.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1030. In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx1201. /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:11 warnings generated when compiling for gfx1200. 1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_groIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ up(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIn file included from IZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flIn file included from ag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | u/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cppint6:4_t*1 pt: r =In file included from re/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.hcvPt:r(013)+ll1: 28OIn file included from ffset/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h; : 174| ^~~ : /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.hIn file included from :13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, dat/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from a2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flagIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp::1: In file included from 80/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: :In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h5:271::19: warning: unused variable 'ptr' [-Wunused-variable] warning: unused variable 'w' [-Wunused-variable]271 | uin80t64_ | t* ptr = re c v bPatrrr(i0e)r+_ll128Offset; | ^~~ by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRu/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ nInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | st/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.hepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL:508:29_: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] EN T506 | R tYid(_tidF),U nthNreadCs(nt_hreadDs), Ewid(Vtid%RWARPE_SIZED),O warPp(ti_d/WARTP_SIYZE),P | ~~~~~~~~~~~~~~~~~~E | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) ( 507S | uwarpmInBl,ock( threradIdcxc.lx_b/WfAlRoPa_tS8I,Z Ef)a,ls e );| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | | ^ warp(tid/WARP_SIZE /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h :387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 508 | flagThread((tid%4)==3), group387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, (garoupl), go | , ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509w | o rstkepSiz)e(n;c c\lShm em. com| m ^.bu ff/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hSizes:[NC670CL_P:ROTO15_LL1: 28note: ]/NCfield 'nthreads' will be initialized after field 'tidInBlock'CL_S TE PS/sizeof(670u | int 64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ t | i group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.hd:199(:57:t note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested herei 199d | ) P,rimi tivnets, 1,threads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: ==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: InBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)note: field 'group' will be initialized after field 'stepSize' )670 | ti{d(ti d) ,| nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ 11 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_bf8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx90a. [ 76%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_f16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_f16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_f16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_f16.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:729 | const int w = threadIdx.x/WARP_SIZE; : warning: unused variable 'w' [-Wunused-variable] \75 | b arrier_by_g roup(); | | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: ^ note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from 75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WAR/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.hP:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7:_ warning: unused variable 'w' [-Wunused-variable] 75 | S bIarrier_Zby_groupE(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h;:29:15: note: expanded from macro 'barrier_by_group' 29 | \ | ^ const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | ui/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.hn:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.ht:145:14: warning: unused variable 'data1' [-Wunused-variable] 32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | 145 | uint3 2_t dauta1, fliag1, ndata2, tflag23; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h2:145:_21: warning: unused variable 'flag1' [-Wunused-variable] t 145 | uint3d2_t daata1, ftlag1, daata2, 1flag2,; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145flag1, data2, flag2; | ^~~~~ :28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp::1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:17513: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.hexpanded from macro 'barrier_by_group' 29 | c:onst i29nt :w = t15hreadId:x.x/W ARP_SInote: ZE; \ expanded from macro 'barrier_by_group'| ^ 29 | const int w = thr: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ead ^Idx.x /WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h::175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:17519: warning: unused variable 'ptr' [-Wunused-variable] : 271 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h u:int271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | 6 4_t * ptr = rec vPtr(0)+ll128Offset; | ^~~ uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:g1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from r/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:o19: warning: unused variable 'ptr' [-Wunused-variable] 271u | puint64_(t* ptr =) recvPtr(;0)+ll128O ffset; | ^~~ | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group'In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: ag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21:In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = re warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2,warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19cvPtr(0)+ll128Offset; | ^~~ | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, f: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ lag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, fa/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ lse); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ nAsymmetric<1,1>, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, In file included from 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, alg/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSio, work); \ | ^ zes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f16.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for host. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx90a. [ 76%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_f32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_f32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_f32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_f32.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | In file included from uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ [ 76%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_f64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_f64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_f64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_f64.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, In file included from mantissa; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ : In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ P_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' In file included from 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_g uint32_t roup(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from 1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIIn file included from dx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cppb:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.hy:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:_174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | g barrirer_by_grooup(); u | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29p:15: note: (expanded from macro 'barrier_by_group' 29 | ) const; int w = th | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:rea15dIdx.:x/ note: expanded from macro 'barrier_by_group' 29 | cWARP_SIZoE; \ nst in | ^ t w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | b/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.ha:13: In file included from r/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: r/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: iunused variable 'data1' [-Wunused-variable] 145 | ueint32_rt data1, fl_bya_ggroup(); | 1, ^~~~~~~~~~~~~~~~~~ data 2,/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:In file included from 13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h, :w508o:r29k:) ; warning: \field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] | ^ 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx90a. 11 warnings generated when compiling for gfx1200. [ 77%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_f8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_f8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_f8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_f8.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ [ 77%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_i32.cpp.o In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantiIn file included from /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_i32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_i32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_i32.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp ssa; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 145 | uint32_t data1, flag1In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ , data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = tIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ hreadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE'/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), g/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ roup(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ (tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) {/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP(th_readIdxS.x), grIoup(grZoup), | E ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:)60: note: field 'group' will be initialized after field 'stepSize' ,670 | tid(ti d), nt| hreads( ~~~~~~~~~~~~~~~~~~nthread s), ti dInBlo| stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | ck(t hreadIdwarpInBlxo.x), gcrk(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508o | up(gro up), | ^~~~~~~~~~~ flagThread((tid%4)==3), group(groupIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOp/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:s508:29: >warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] (506 | c tid(otid),m nthremads(n,threa ds), awid(ltid%WgARP_SoIZE), ,warp(t id/WAwRP_SIoZErk); \ | ) ^, | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | In file included from warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:t1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13): In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173): /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670: 15:{ warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | | tid ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~(tid) , | group(groupnthreads(nthreads), tidIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdxInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h_:P199:57:R note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested hereO 199 | T PrOimiti_ves, 1, P]roto,/ 0> pNrims C | ^ C/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:3:L1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here_ 3 | SMSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_f.lx), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ oat8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.hTEPS/sizeo:f(T) 384: ste:pSize3_) { : | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.hnote: :199:expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE'57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primit384ives | < mIn file included from T, RedOp, FanAsymmetric<1,1>, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCLsc_clRunIInterIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: MpreterP, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ##deKvredoEp, NPEL_ENTRY_FUroNtoLL1C28, f_ullOpDs>(coEmm, alVgo, woRrk); E\ | ^ DOP_TYPE(Sum, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(com m| tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ ,671 | staepSilze(sgtepSoize_, == 0 ? nwccloShmerm.cokmm); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | . buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~) : s/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.htepSi:ze_670) {: | 60 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ :| group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:note: 199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested herefield 'group' will be initialized after field 'stepSize' 199 | Prim670iti | ve s, ,1, Progto,r 0>o pruipms ( | ^ g/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:r3:1o: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested hereu p3 | M)SCC,L_IM PL_ KE| RNE ^~~~~~~~~~~L_E NTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.xIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: ), groupwarning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ (group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_f8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_float8, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, In file included from data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_b:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable]y 145 | u_groiup(); n | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:t29:15: note: 3expanded from macro 'barrier_by_group' 292 | co_nst intt w = t hreadIddx.x/WAaRP_SItZE; \ a| ^ 1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: In file included from expanded from macro 'barrier_by_group' 29 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h: 13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | c barorier_byn_group(s); | ^~~~~~~~~~~~~~~~~~ t/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | i connst int tw = thr eadIdx.w = threadIdx.x/WARP_SIZE; \ | ^ x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ ier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = thIn file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ readIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ (); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthre/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.hads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(ti/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ d), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | ti warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NC/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ CL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.hd(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ type>, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx110211 warnings generated when compiling for gfx908. . 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx90a. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx1100. [ 77%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_i64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_i64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_i64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_i64.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp 11 warnings generated when compiling for gfx90a. [ 77%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_i8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_i8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_i8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_i8.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrierIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ _by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flaIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ g2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13:: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7:In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ :508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid( | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nt/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpphreads), wid(tid%WARP_SIZE), warp(ti:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here d199/ | W A RPPr_iSmIiZtEi)v,e s <| T ~~~~~~~~~~~~~~~~~~, R| e stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)d Op, FanA s507y | m m e t rwiacrl,o c1k,( tPhrroetaod,I d0x>. xp/rWiAmRsP _ S| I ^Z E), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp | : warp(tid/WARP_SIZE3 :1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 508 | 3f | lMaSgCTChLr_eIaMdP(L(_tKiEdR%N4E)L=_=E3N)T,RY _gFrUoNuCp_(DgErVoRuEpD)O,P _ T| Y ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~P E (| S warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3u m, int8 _509t | , f a lsstee)p;Si z e| (^n cclShm/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.he:m387.:c3o:m mnote: .expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE'b uffSizes[N C387C | L_ P RmsOcTcOl_RLuLn1I2n8t]e/rNpCrCeLt_eSrT ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~, P| r group(group otoSimple, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested hereI CESTEP S,199 | M SPCrCiLm_SiLtiIvCeEsSp,, fFualnlAOspysm>m(ectormimc,< 1a,l1g>o,, 1w,o rPkr);o to\, 0| ^> prims | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h ^: 670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:3:1: note: 670 | in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here ti d3( | tMiSdC)C,L _nItMhPrLe_aKdEsR(NnEtLh_rEeNaTdRsY)_,F UtNiCd_IDnEBVlRoEcDkO(Pt_hTrYePaEd(ISduxm.,x )i,n tg8r_otu,p (fgarlosuep));, | | ^ ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h670::38460::3 :note: field 'group' will be initialized after field 'stepSize'note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 670 | 384 | tmisdc(ctlidR)un,I nnttehrrperaedtse(rnI, dPxr.oxt)o,L gLr12o8u,p (fgurloluOpp)s,> ( c| ^~~~~~~~~~~o mm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_i8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx90a. [ 78%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_u32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_u32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_u32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_u32.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx90a. [ 78%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_u64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_u64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_u64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_u64.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14In file included from : /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18:/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ :1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_bIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ y_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_gro_t data1,up(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.hIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIunused variable 'flag2' [-Wunused-variable] 145 | uint32In file included from _t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:575:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const i: warning: unused variable 'w' [-Wunused-variable] nt w = threadIdx.x/WARP_SIZE; \ | ^ 80 | barrier_by_groupZE;( \ | ^) ; | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h145:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h::174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:147: warning: :unused variable 'w' [-Wunused-variable] 75 | warning: barrunused variable 'data1' [-Wunused-variable]ier_by _group() ; | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 15:145 note: expanded from macro 'barrier_by_group' | 29 | const int w = thre uint32_t data1, flag1, dataad2Idx.x/W,ARP_ SIZE; f\ | ^ In file included from lag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.hconst int w = threadIdx.x/WARP_SIZE; \ | ^ :145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t dat/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:a1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:131: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h,:145:14 : warning: unused variable 'data1' [-Wunused-variable] f145 | luint32a_t datga1, fla1g1, d,ata2, flag2;d | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.ha:145:21: warning: unused variable 'flag1' [-Wunused-variable] t145 | uiant32_t2 data1,, flaIn file included from flag2; | ^~~~~ g1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t dataIn file included from 1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35:/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll1In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: 28Of warning: unused variable 'flag2' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | ui 145 | nt64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' fset; | 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag180:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from , data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = thIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ readIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: In file included from expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | u/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \int 32_t d ata1, f| lag1, d ^ata2, f lag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:1335: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZtid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | PrimitivesO_LL128<]/NCCL_TSTEPS/s,izeof(u int64_t)R) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.hd:199:57: Onote: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t,p, FanAsymmetric<1,1>, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreterunInte,rprete r, oProtoLLL128, fLullOps1>(comm, algo, work); \ | ^ 28, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE)In file included from , warp/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(grou/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.hp:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173): /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15:, warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid| ), nthr ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~eads(n threads ), tidInB| lock(th tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_readIdx .x), g roup(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclSh(mtid/WAReP_SIZEm), | ~~~~~~~~~~~~~~~~~~ . | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | c waorpInBlomck(thremadIdx.x./WARP_SbIZE), | u ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE f508 | fflagThreSad((tiid%4)==3)z, groupe(group)s, | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~[NCCL_PROTO_SIMPLE]/NCCL_STEPS | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prim/siszeof( T) : stepS| ize_) ^{ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVRED/sizeOof(uPint64__t)) {T | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ Y | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.hP:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here E 199 | Primitiv(es, 1 , Pr:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVuinREDOP_TYPE(Sum, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ t32_to,to, 0> pfrimsa | ^ l/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:s3:1:e note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here ) 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: ; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tinote: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterd), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ preter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, tid(tid), nthreads(nthreads), t1i, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ dInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MS/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] CCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ pe>, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid )| ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u32.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t dIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ ata1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h::174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: 173warning: unused variable 'data1' [-Wunused-variable] 145 | : uint32_/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.ht data1, f:lag1, da75ta2, flag:2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h7:145:21: warning: unused variable 'flag1' [-Wunused-variable] : warning: unused variable 'w' [-Wunused-variable] 75 | 145 | buint32_at data1,r flag1, drata2, fliag2; | ^~~~~e /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: rwarning: unused variable 'data2' [-Wunused-variable] 145 | _ uint32b_t data1y_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uiIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ nt32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ <1,1>, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | t:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warid(tidp), nthr(eads(nthrteads), tidiInBlock(tdhreadIdx.x/), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u64.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx90a. [ 78%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_u8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_u8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_u8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_u8.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx90a. [ 78%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_bf16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_bf16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_bf16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_bf16.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t dIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ ata1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ : warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ , flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29In file included from | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp : 1c: oIn file included from n/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.hst: 13i: nIn file included from t/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h :w175 : = /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.ht:h271r:e19a:d Iwarning: dunused variable 'ptr' [-Wunused-variable]x .x/WARP_SIZE; 271\ | | ^ uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNE/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509L_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ edOp, FanAsymmetric<1,1>, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:384:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 384 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:13: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:199:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 199 | Primitives, 1, Proto, 0> prims | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/msccl_kernel_Sum_u8.cpp:3:1: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2, 2>, false>' requested here 3 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/msccl_kernel_impl.h:387:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 387 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2,In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdxIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ .x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_grouIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ p(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf16_2, ncclFuncReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf16_4, ncclFuncReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf16_2, ncclFuncReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf16_2, ncclFuncReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf16_2, ncclFuncReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn)OLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf16_2, ncclFuncReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf16_2, ncclFuncReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_MinMax_bf16_2, ncclFuncReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShm:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf16_4, ncclFuncReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf16_2, ncclFuncReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ em.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf16_4, ncclFuncReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf16_2, ncclFuncReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf16_2, ncclFuncReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf16_4, ncclFuncReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf16_4, ncclFuncReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf16_4, ncclFuncReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf16_4, ncclFuncReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf16_4, ncclFuncReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf16_2, ncclFuncReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidIn file included from InBlock(threadIdx.x), g/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:2r: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: oIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, up(group), | ^~~~~~~~~~~ work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf16_2, ncclFuncReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf16_4, ncclFuncReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf16_4, ncclFuncReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf16_4, ncclFuncReduce, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx90a. [ 79%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_bf8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_bf8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_bf8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_bf8.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hIn file included from :/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 12 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cppIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28:In file included from warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, datIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from a2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_tIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ [ 79%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_f16.cpp.o In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_f16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_f16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_f16.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf8_2, ncclFuncReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf8_2, ncclFuncReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid (tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf8_2, ncclFuncReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf8_4, ncclFuncReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_MinMax_bf8_2, ncclFuncReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf8_2, ncclFuncReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf8_2, ncclFuncReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf8_2, ncclFuncReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf8_2, ncclFuncReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf8_2, ncclFuncReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf8_4, ncclFuncReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | 11 warnings generated when compiling for host. group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf8_2, ncclFuncReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf8_4, ncclFuncReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf8_4, ncclFuncReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf8_4, ncclFuncReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads),), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf8_4, ncclFuncReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf8_4, ncclFuncReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf8_4, ncclFuncReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf8_2, ncclFuncR/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf8_4, ncclFuncReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) educe, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf8_2, ncclFuncReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf8_4, ncclFuncReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_bf8_4, ncclFuncReduce, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | In file included from const int w = threadId/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from x/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175.: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | In file included from uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:In file included from 2/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ : In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_MinMax_f16_2, ncclFuncReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f16_2, ncclFuncReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f16_2, ncclFuncReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f16_4, ncclFuncReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPL/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f16_2, ncclFuncReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ E_MinMax_f16_2, ncclFuncReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f16_2, ncclFuncReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f16_4, ncclFuncReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f16_2, ncclFuncReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f16_2, ncclFuncReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f16_2, ncclFuncReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sen, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f16_4, ncclFuncReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), dntbhurfefa,d sw(onrtkh-r>eraedcsv)b,u ftfi,d IwnoBrlko-c>kr(etdhOrpeAardgI,d x0.,x )w,o rgkr-o>ucpo(ngnrIonudpe)x,, w| o ^~~~~~~~~~~~~~~~~r k->c/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.ho:n670nIn:d60e:x )note: ;field 'group' will be initialized after field 'stepSize' | ^ 670 | tid(t/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.hi:d63):,5 :n tnote: hin instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested herer eads(nt h63r | e a d s )r,u ntRiidnIgnu(pt)i,d , | n ^~~~~~~~~~~t hreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f16_4, ncclFuncReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f16_4, ncclFuncReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f16_2, ncclFuncReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthread/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpps(:n2t: hIn file included from re/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.ha:d11s: )In file included from ,/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h :t173i: d/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hIn:B670l:o15c:k (warning: thinitializer order does not match the declaration order [-Wreorder-ctor]r eadIdx.x), group(gro u670p | ) , | ^~~~~~~~~~~t id(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f16_2, ncclFuncReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f16_4, ncclFuncReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f16_4, ncclFuncReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f16_2, ncclFuncReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f16_4, ncclFuncReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f16_4, ncclFuncReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f16_4, ncclFuncReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f16_4, ncclFuncReduce, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx942. 12 warnings generated when compiling for gfx90a. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx942. 12 warnings generated when compiling for gfx90a. [ 79%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_f32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_f32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_f32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_f32.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx1201. [ 79%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_f64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_f64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_f64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_f64.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | In file included from uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ x.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174271 | : uint64_/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.ht* ptr = :recvPtr(075)+ll128Offset; | ^~~ :7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h :11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:7514: warning: unused variable 'data1' [-Wunused-variable] 145 | | uint32 _ t d barrierata1, flag1_,b yd_group(); | ^~~~~~~~~~~~~~~~~~ ata2, /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.hflag2; | : ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h29:15: note: expanded from macro 'barrier_by_group' 29 | const:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | i uint32_ntt w = data1t,h reflaadg1,Idx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp :data2, f2lag2; : | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145In file included from :28: warning: unused variable 'data2' [-Wunused-variable] 145 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ :11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uIn file included from int3In file included from 2_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h::174145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ : /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = th/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ readIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SI/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ ZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19In file included from : warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp: 2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreadsIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f32_2, ncclFuncReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f32_2, ncclFuncReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f32_2, ncclFuncReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f32_2, ncclFuncReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Red/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncuce_RING_SIMPLE_MinMax_f32_2, ncclFuncReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f32_4, ncclFuncReduce, FuncMinMax, float,clDevFunc(Reduce_RING_SIMPLE_MinMax_f32_4, ncclFuncReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f32_4, ncclFuncReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff,In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: wowarning: rk->reinitializer order does not match the declaration order [-Wreorder-ctor]dOpArg, 0, work->con nIndex, work->connIndex); | 670 ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h | :63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | run Ring(tiid, nthreads, workd(tid), nthreads(n); t | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432h:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2r: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.he:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | ads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f32_2, ncclFuncReduce, FuncMinMax, fl 432 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ if 671(tid < sub | tn) RunW orkCol l()m.run(tid, seubtmn.,comm.b uwork);f fSi | z ^e s[/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cppNC:C7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINoat, NCCEL_ALGO__RING, /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f32_4, ncclFuncReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreadNCCL_PROTOn_In file included from cclDevFunc(Reduce_RING_SIMPLE_MinMax_f32_2, nccSIMPLE,l 2) | ^ F/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ L/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h_PROTO_SIMPLE]/N:CCL_STE670PS/sizeo:f(T) : 15stepSize_): { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:note: 7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here field 'nthreads' will be initialized after field 'tidInBlock' 33 | pr ims(ti d670 | tid(tis), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ d), nthreads(nthre, ntahreads,d &ring->psrev, &), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f32_2, ncclFuncReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(ntuncReduceh, FuncMinMarx, float,e NCCL_ALaGO_RING, NCdCL_PRsOTO_S)IMPLE, 2,) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.htidInBlo:611:62c: note: expanded from macro 'DEFINE_ncclDevFunc'k (t hreadIdx611 | R.ux), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_nWork Batch | stepSiz, aelgo, pr(otostepSize, unr_oll>( )== 0 ? nc.run(c); \ | ^l nthre/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hSads(hnthread:ms), tidIn670eBlock(t:hremad15I.dx.x), g:roup(grou p),note: | ^~~~~~~~~~~ field 'nthreads' will be initialized after field 'tidInBlock'comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f32_2, ncclFuncReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_MinMax_f32_2, ncclFuncReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f32_4, ncclFuncReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ n, T, RedOp, Algo, Proto, COLL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f32_2, ncclFuncReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f32_2, ncclFuncReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE _ncclDevFunc(Reduce_R670ING_SIMP | LE_MinMax _f32_4, ncc lFuncReduce, FuncMinMa x, float, tNCid(tid), nthreads(nthreads),CL_ALGO_R ING, NCCL_tPROTO_SIi/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670M:15: warning: initializer order does not match the declaration order [-Wreorder-ctor]P 670 | Ltid(tdid)E, nthreI,ads(nn threads),4B tidInBlol)ck(threadIo dx c.x), grou| pk(group),^ ( | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ t671 | st/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hehpSizreadIdx.x), group:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, Lalgo, proEto, unroll>]().run()/; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:N670:15: note: Cfield 'nthreads' will be initialized after field 'tidInBlock' 670 | tiCd(tid), L]/NCCL__nSTEPS/siSzteof(T) :Th steErpSizPe_e) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ Sa | group(group /d/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | psizeof(Ts()nthreadsr ims(tid, n:th stepSize_) { ), tidInBlock(trea ds, h&ring->| prrev ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~,e a&d| group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested hereIdx. x), group(group)rin,g->nex t , work33 | prims(tid, | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | ->setndbuff, iwork-/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670d> | tid(tid), nthrerecvbuads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSi(ftif, dw), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ork->redOpArg, 0, worze_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group nthreakds, &r/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f32_4, ncclFuncReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ -ing->prev>, connIndex, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f32_4, ncclFuncReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f32_4, ncclFuncReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:12:1: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hnote: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f32_4, ncclFuncReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f32_4, ncclFuncReduce, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, manIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ tissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrie/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11r: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] _ 75 | b barrier_yby_group()_; | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:g29:15:r note: expanded from macro 'barrier_by_group' 29 | o const int w = threadIdx.x/WARP_SIZE; \ | ^ up(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIIn file included from ZE; \ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_grou| ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2p(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ ; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 35: 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t dataIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, fIn file included from lag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, da/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from t/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.ha:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:2, flag2; | ^~~~~ 80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271barrier_by_gro:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ up(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | + ll128Off set; | ^~~ uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flagIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h::11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: 174/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: :/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h :warning: 75unused variable 'w' [-Wunused-variable]:7 : 80warning: unused variable 'w' [-Wunused-variable] | 75 | barr ier_babrrier_by_group()y_grou;p(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.hnote: expanded from macro 'barrier_by_group': 2929 | : const int wIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ = threadIdx.x/WARP_SIZE; \ | ^ 15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f64_2, ncclFuncReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f64_2, ncclFuncReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f64_2, ncclFuncReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f64_4, ncclFuncReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f64_4, ncclFuncReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f64_4, ncclFuncReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f64_2, ncclFuncReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f64_2, ncclFuncReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f64_2, ncclFuncReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f64_2, ncclFuncReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_MinMax_f64_2, ncclFuncReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f64_2, ncclFuncReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f64_2, ncclFuncReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f64_4, ncclFuncReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f64_4, ncclFuncReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f64_4, ncclFuncReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReducIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_RING_SIMPLE_MinMax_f64_2, ncclFuncReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ e_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f64_2, ncclFuncReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f64_4, ncclFuncReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f64_4, ncclFuncReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f64_4, ncclFuncReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f64_4, ncclFuncReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f64_4, ncclFuncReduce, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 1111 warnings generated when compiling for gfx1201. warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx906. 12 warnings generated when compiling for gfx90a. [ 80%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_f8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_f8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_f8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_f8.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx908. 1111 warnings generated when compiling for gfx1200. warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx906. 12 warnings generated when compiling for gfx90a. [ 80%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_u32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_u32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_u32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_u32.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ _t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_groIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ up(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2;In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp| : ^~~~~2 : In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h::11145: :In file included from 28/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:: 175warning: : unused variable 'data2' [-Wunused-variable]/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h :271: 19145: | warning: unused variable 'ptr' [-Wunused-variable] uint32_t d a271t | a 1 , f l a g 1u,i ndta6t4a_2t,* fpltarg 2=; r e| c ^~~~~v Ptr/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h(:0145):+35l:l 1warning: 2unused variable 'flag2' [-Wunused-variable]8 Offs e145t | ; | ^~~u int32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f8_2, ncclFuncReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &riIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid),ng->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f8_2, ncclFuncReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ (tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f8_2, ncclFuncReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_In file included from RING_SIMPLE_MinMax_f8_2, ncclFuncReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(st| tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | e stepSpize(stepSSize_ == i0 ? ncclSzhmem.comem.buffSize_s[NCCL_PROTO_SIMPL E]/NCCL_=STEPS/siz=eof(T) :0 ? ncclShmem.comm.buff stepSSize_) { i | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group z/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ringes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->rec->prvev, &rinbg->nextu, work-f>sendbuff,f work->r,ecvbuff , work->wredOpArgo, 0, work->rconnIndexk, work->-connInde>x); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRinIn file included from g(tid,15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(thrredOpAerg, 0, waork->connIdndex, wIork->connIdndex); | x ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63.:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here x 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7), | group(gDroup), E | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ F | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | I stepNSize(stepESize__ == 0 ? ncclShmem.ncomm.buffcSizes[NCCLc_PROTO_SlIMPLE]/NDCCL_STEPeS/sizeovf(T) : stFepSizeu_n)c(Reduce_RI { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduNG_SIcMPLE_MinMeax_f8_2_, ncclRFuncReducIe, FuncMNinMax, rcGcl_float_8, NCCSL_ALGO_RIING, NCMCL_PROTOP_SIMPLEtiL,d, nthEr eads,_ &2ringM->pr)ev, i&rin gn->ne xt, wo| rk-M>sen^dbufaf, w orxk->r/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h_ecvbfu8f_2, ncclFuncReduce, FuncMinM:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatchredO,pArg, 0, work->tconnIy, redop, algo, proto,ndex, work->counnIndenx); | ^ r/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.hol:63l>:().5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f8_2, ncclFuncReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ run(); \ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid):, nthre670ads(nthre:ads), 15tidInBloc:k(threa dIdx.x), note: group(grfield 'nthreads' will be initialized after field 'tidInBlock'oup), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h: 670:60: note: field 'group' will be initialized after field 'stepSize' 670670 | | tid tid(tid), nthreads(nthreads), tidInBlock(threadI(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ dx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFIN670 | E_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f8_2, ncclFuncReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f8_2, ncclFuncReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f8_2, ncclFuncReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ Block(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f8_4, ncclFuncReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work:670-:15: warning: >sendbuff, work->recvbuff, work->initializer order does not match the declaration order [-Wreorder-ctor]r 670edOpArg, 0, work->connIn | dex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f8_4, ncclFuncReduce, Fu tid(tid), nthreads(nthreads), tidIn/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.commBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ Proto, COLL_UNROLL>(t.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ eads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f8_4, ncclFuncReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670o, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:12::15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group),1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f8_4, ncclFuncReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBat | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ize(st/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.he:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor]p 670 | S tid(tiid), nthrzeads(nthereads)_, tidInBl ock(/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here th=read77 | runRinId=x.xg(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_MinMax_f8_2, ncclFuncReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ 0 ? grounccp(lSghrmoeump.c),om m.buffSizes[N| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671C | CL_PROstepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizeTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, wors[NCCkL_PROTO_S-IMPLE]/NC>CL_STEPS/ssizeof(T) :e stepSize_n) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ d | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | b prims(tuid, ntfhf, work->recreadvs, &ringb-uff, work>prev, -&ri>redOpAng->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid cosnnIndeux, work->bconnIndex)t; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.hn:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here )63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWork/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, RunWorkCollsendbuff, work->n, T,recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | RedOp, Algo, Proto, COLL_UNROLL>().run(tid, subtn, work); | ^ runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DCollE, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ,F RedIOp,N AlgoE, Prot_o, COLL_UNROLnL>().run(ticd, clDevsubtnF,u nc(woReduce_RINGr_k); | ^S /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cppI:MPLE_MinMax_f8_12:41: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here ,12 | DEFINEn_cncccllFuncReduce,DevFunc(Reduce_RING_SIMPLE_MinMax_f8_4, ncclFuncRedu FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, prce, FunocMinMax, trccl_float8, NCCoL_ALGO_,RING, NCC L_PROuTO_SnIMPLE, 4r) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.ho:611:62: lnote: expanded from macro 'DEFINE_ncclDevFunc' l611 | >RunWorkBa(tch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group),In file included from | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: 670 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp tinote: d(tid):, field 'group' will be initialized after field 'stepSize'nth1reads(nthre ads), ti dInBlock(thre670adIdx.x), | group(g tid(tid), roup), n | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:threads(nthreads), tidIn670:60: note: Bfield 'group' will be initialized after field 'stepSize' 670l | tiod(tid)c, nkthr(ethreadIdx.x), grads(nthreads), tiodInBlocukp(th: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:r14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] eadIdx.x),77 | g uint32_rt y, heaod, manutissa; p | ^ (group), | ^~~~~~~~~~~ (group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f8_4, ncclFuncReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f8_2, ncclFuncReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f8_4, ncclFuncReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threaIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ dIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f8_4, ncclFuncReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_f8_4, ncclFuncReduce, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrierIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ _by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = thIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ readIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ _t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uintIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u32_2, ncclFuncReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u32_2, ncclFuncReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u32_2, ncclFuncReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u32_2, ncclFuncReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_MinMax_u32_2, ncclFuncReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u32_2, ncclFuncReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u32_4, ncclFuncReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u32_4, ncclFuncReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u32_4, ncclFuncReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u32_2, ncclFuncReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u32_4, ncclFuncReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ 670 | tid(tid), nthreads(nthreads), tidInBlock(threadId/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ x.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u32_2, ncclFuncReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u32_4, ncclFuncReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(gIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthroup), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u32_2, ncclFuncReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ reads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u32_2, ncclFuncReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u32_2, ncclFuncReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u32_2, ncclFuncReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u32_4, ncclFuncReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u32_4, ncclFuncReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u32_4, ncclFuncReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u32_4, ncclFuncReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u32_4, ncclFuncReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u32_4, ncclFuncReduce, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx942. 12 warnings generated when compiling for gfx90a. [ 80%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_u64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_u64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_u64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_u64.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y,In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12In file included from : In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ :15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 12 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u64_2, ncclFuncReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u64_2, ncclFuncReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_MinMax_u64_2, ncclFuncReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u64_2, ncclFuncReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u64_2, ncclFuncReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u64_2, ncclFuncReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u64_2, ncclFuncReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u64_4, ncclFuncReduce, FuncMinMax, uisubtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u64_4, ncclFuncReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ nt64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u64_2, ncclFuncReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u64_4, ncclFuncReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u64_4, ncclFuncReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u64_2, ncclFuncReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u64_2, ncclFuncReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u64_4, ncclFuncReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u64_4, ncclFuncReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u64_4, ncclFuncReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u64_2, ncclFuncReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u64_4, ncclFuncReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(grIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u64_2, ncclFuncReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ oup), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u64_4, ncclFuncReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u64_4, ncclFuncReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u64_4, ncclFuncReduce, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx906. [ 80%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_u8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_u8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_u8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_u8.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7:In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.hIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ :11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ : In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 12 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ [ 80%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_bf16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_bf16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_bf16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_bf16.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u8_2, ncclFuncReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u8_4, ncclFuncReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u8_2, ncclFuncReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u8_2, ncclFuncReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u8_2, ncclFuncReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u8_2, ncclFuncReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u8_2, ncclFuncReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u8_2, ncclFuncReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_MinMax_u8_2, ncclFuncReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u8_2, ncclFuncReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u8_4, ncclFuncReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u8_2, ncclFuncReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u8_4, ncclFuncReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u8_2, ncclFuncReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u8_4, ncclFuncReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u8_4, ncclFuncReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u8_4, ncclFuncReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u8_4, ncclFuncReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u8_4, ncclFuncReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u8_4, ncclFuncReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u8_2, ncclFuncReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u8_4, ncclFuncReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_minmax_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_MinMax_u8_4, ncclFuncReduce, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf16_2, ncclFuncReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf16_2, ncclFuncReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ T, RedOp, Algo, Proto, COLL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf16_2, ncclFuncReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf16_4, ncclFuncReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf16_2, ncclFuncReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf16_4, ncclFuncReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ LL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINEs_niczceloDfev(FTu)n c:( RsedtuecpeS_iRzIeN_G)_ S{I M P| L ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~E _ P| r group(groupe MulSum_bf16_2, ncclFuncReduce, Fun/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.hcP:r33e:M7u:l Snote: umin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here, hip_bfloat16 ,33 | N C C L _ A LpGrOi_mRsI(NtGi,d ,N CnCtLh_rPeRaOdTsO,_ S&IrMiPnLgE-,> p2r)e v ,| ^ &ring->/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hn:e611x:t62:, note: wexpanded from macro 'DEFINE_ncclDevFunc' ork-> s611e | n d b u fRufn,W owrkoBraktc-h>O,p Aarlgg,o ,0 ,p rwootrok,- >ucnornonlIl>n(d)ex.,ru nw(o)r;k -\> c o| n ^n Index); | ^/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h :670:15: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.hnote: :field 'nthreads' will be initialized after field 'tidInBlock'63 :5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 670 | 63 | t i d (rtuindR)i,n gn((tthidr,e andtIhdrxe.axd)s,, gwroorukp)(;g r o| u ^p ), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h670::43260::78 :note: field 'group' will be initialized after field 'stepSize'note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 670 | 432 | t i d( t iidf) ,(t indt h().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf16_4, ncclFuncReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf16_2, ncclFuncReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ up), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf16_2, ncclFuncReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf16_2, ncclFuncReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_PreMulSum_bf16_2, ncclFuncReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf16_2, ncclFuncReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf16_2, ncclFuncReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ id(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf16_2, ncclFuncReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf16_4, ncclFuncReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf16_4, ncclFuncReduce, Func/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf16_4, ncclFuncReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ PreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf16_4, ncclFuncReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf16_4, ncclFuncReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf16_4, ncclFuncReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf16_4, ncclFuncReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf16_4, ncclFuncReduce, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx906. 12 warnings generated when compiling for gfx90a. [ 81%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_bf8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_bf8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_bf8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_bf8.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cppIn file included from :2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint3In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 2_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ ); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ : /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ :35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf8_2, ncclFuncReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf8_4, ncclFuncReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf8_2, ncclFuncReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf8_2, ncclFuncReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf8_2, ncclFuncReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf8_2, ncclFuncReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tiIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf8_2, ncclFuncReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ d), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf8_2, ncclFuncReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf8_2, ncclFuncReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf8_4, ncclFuncReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf8_4, ncclFuncReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthrea/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf8_4, ncclFuncReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ds(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf8_4, ncclFuncReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threa/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf8_4, ncclFuncReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(thrdIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ eadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf8_2, ncclFuncReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf8_4, ncclFuncReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf8_4, ncclFuncReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf8_2, ncclFuncReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_PreMulSum_bf8_2, ncclFuncReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf8_4, ncclFuncReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf8_2, ncclFuncReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf8_4, ncclFuncReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_bf8_4, ncclFuncReduce, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx908. 12 warnings generated when compiling for gfx90a. [ 81%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_f16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_f16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_f16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_f16.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flaIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ g1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145In file included from :14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145In file included from | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIn file included from IZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f16_2, ncclFuncReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f16_2, ncclFuncReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f16_2, ncclFuncReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f16_2, ncclFuncReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f16_2, ncclFuncReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f16_2, ncclFuncReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f16_2, ncclFuncReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f16_4, ncclFuncReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f16_4, ncclFuncReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f16_4, ncclFuncReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f16_4, ncclFuncReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f16_2, ncclFuncReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f16_2, ncclFuncReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f16_4, ncclFuncReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | ) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f16_4, ncclFuncReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f16_4, ncclFuncReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f16_4, ncclFuncReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f16_4, ncclFuncReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f16_2, ncclFuncReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_PreMulSum_f16_2, ncclFuncReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f16_2, ncclFuncReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f16_4, ncclFuncReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f16_4, ncclFuncReduce, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1100. 12 warnings generated when compiling for gfx90a. [ 81%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_f32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_f32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_f32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_f32.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 12 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f32_2, ncclFuncReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_PreMulSum_f32_2, ncclFuncReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f32_4, ncclFuncReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f32_2, ncclFuncReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f32_2, ncclFuncReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f32_2, ncclFuncReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f32_2, ncclFuncReduce, FuncPreMulSum, float, NCCL_A/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f32_2, ncclFuncReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from LGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f32_2, ncclFuncReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f32_2, ncclFuncReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f32_4, ncclFuncReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f32_2, ncclFuncReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f32_2, ncclFuncReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f32_4, ncclFuncReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f32_2, ncclFuncReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f32_4, ncclFuncReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f32_4, ncclFuncReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f32_4, ncclFuncReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f32_4, ncclFuncReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f32_4, ncclFuncReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f32_4, ncclFuncReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f32_4, ncclFuncReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f32_4, ncclFuncReduce, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1201. [ 81%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_f64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_f64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_f64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_f64.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx908. 12 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ [ 82%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_f8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_f8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_f8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_f8.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.hIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ :11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_bIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ y_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15:, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f64_2, ncclFuncReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f64_2, ncclFuncReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f64_2, ncclFuncReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f64_2, ncclFuncReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f64_2, ncclFuncReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f64_2, ncclFuncReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ _RING_SIMPLE_PreMulSum_f64_2, ncclFuncReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f64_2, ncclFuncReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f64_2, ncclFuncReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_PreMulSum_f64_2, ncclFuncReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f64_4, ncclFuncReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f64_4, ncclFuncReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f64_4, ncclFuncReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f64_4, ncclFuncReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f64_2, ncclFuncReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f64_2, ncclFuncReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f64_4, ncclFuncReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f64_4, ncclFuncReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f64_4, ncclFuncReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f64_4, ncclFuncReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f64_4, ncclFuncReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f64_4, ncclFuncReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f64_4, ncclFuncReduce, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hexpanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f8_2, ncclFuncReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f8_2, ncclFuncReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f8_2, ncclFuncReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f8_2, ncclFuncReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f8_2, ncclFuncReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f8_4, ncclFuncReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFuIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f8_2, ncclFuncReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto,nc(Reduce_RING_LL128_PreMulSum_f8_2, ncclFuncReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f8_4, ncclFuncReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f8_4, ncclFuncReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f8_2, ncclFuncReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f8_2, ncclFuncReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f8_4, ncclFuncReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f8_4, ncclFuncReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f8_4, ncclFuncReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f8_2, ncclFuncReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | ti:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f8_4, ncclFuncReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ d(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f8_4, ncclFuncReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f8_2, ncclFuncReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); 11 warnings generated when compiling for host. | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f8_2, ncclFuncReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f8_4, ncclFuncReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f8_4, ncclFuncReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_f8_4, ncclFuncReduce, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx1102. 1111 warnings generated when compiling for gfx942. warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1101. 12 warnings generated when compiling for gfx90a. [ 82%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_u32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_u32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_u32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_u32.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t dataIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u32_2, ncclFuncReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u32_2, ncclFuncReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u32_2, ncclFuncReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u32_2, ncclFuncReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u32_4, ncclFuncReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u32_2, ncclFuncReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u32_2, ncclFuncReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u32_4, ncclFuncReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u32_4, ncclFuncReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u32_4, ncclFuncReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u32_4, ncclFuncReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u32_2, ncclFuncReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u32_2, ncclFuncReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u32_2, ncclFuncReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u32_4, ncclFuncReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u32_2, ncclFuncReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPL/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_PreMulSum_u32_2, ncclFuncReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ E, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u32_2, ncclFuncReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u32_4, ncclFuncReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u32_4, ncclFuncReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u32_4, ncclFuncReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u32_4, ncclFuncReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u32_4, ncclFuncReduce, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1200. 12 warnings generated when compiling for gfx90a. [ 82%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_u64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_u64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_u64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_u64.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp 12 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cppIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | ba:r2r: iIn file included from er/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h_:b11y: _In file included from g/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.hr:o175u: p/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h():;80 : 5| : ^~~~~~~~~~~~~~~~~~ warning: unused variable 'w' [-Wunused-variable] /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 80 | 29 | bcaornrsite ri_nbty _wg r=o utph(r)e;a d I| d ^~~~~~~~~~~~~~~~~~x .x/WARP/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h_:S29I:Z15E:; note: \expanded from macro 'barrier_by_group' | ^ 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrieIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ r_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cppIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u64_2, ncclFuncReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u64_2, ncclFuncReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u64_2, ncclFuncReduce, FuncPreMulSum, uint64_t, NCCL_ALGO/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nth_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ reads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u64_4, ncclFuncReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u64_2, ncclFuncReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u64_2, ncclFuncReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u64_2, ncclFuncReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u64_2, ncclFuncReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_PreMulSum_u64_2, ncclFuncReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u64_2, ncclFuncReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u64_4, ncclFuncReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u64_2, ncclFuncReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u64_2, ncclFuncReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u64_4, ncclFuncReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u64_2, ncclFuncReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u64_4, ncclFuncReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u64_4, ncclFuncReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u64_4, ncclFuncReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u64_4, ncclFuncReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u64_4, ncclFuncReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u64_4, ncclFuncReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u64_4, ncclFuncReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u64_4, ncclFuncReduce, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1100. [ 82%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_u8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_u8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_u8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_u8.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx1101. 12 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ [ 83%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_bf16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_bf16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_bf16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_bf16.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.hIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ :11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.hIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :80:5: warning: unused variable 'w' [-Wunused-variable] In file included from 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u8_2, ncclFuncReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u8_2, ncclFuncReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u8_4, ncclFuncReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u8_2, ncclFuncReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u8_2, ncclFuncReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u8_2, ncclFuncReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_PreMulSum_u8_2, ncclFuncReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTOIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u8_2, ncclFuncReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u8_4, ncclFuncReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthre_adLsL)1,2 8t,i d2I)n B lo| c^ k(threadIdx.x), /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hg:r611o:u62p:( gnote: rexpanded from macro 'DEFINE_ncclDevFunc'o up), | ^~~~~~~~~~~~~~~~~611 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h :670 :R60u:n Wnote: ofield 'group' will be initialized after field 'stepSize'r kBatch <670 | c o l l ,t tiyd,( tride)do,p ne,a dasl(gnot,h rperaodst)o, unroll>().run(); \ | ^ , tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u8_2, ncclFuncReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u8_2, ncclFuncReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u8_2, ncclFuncReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u8_2, ncclFuncReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u8_4, ncclFuncReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u8_4, ncclFuncReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u8_4, ncclFuncReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u8_2, ncclFuncReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u8_4, ncclFuncReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u8_4, ncclFuncReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u8_4, ncclFuncReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u8_4, ncclFuncReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u8_4, ncclFuncReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_PreMulSum_u8_4, ncclFuncReduce, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.hIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :174In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | : c/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.ho:n145s:t14 :i nwarning: t unused variable 'data1' [-Wunused-variable]w = threadIdx.x/WA RP145_ | S I Z E ;u i\n t3| 2 ^_ t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from 2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | In file included from u/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cppi:n2t: 32In file included from _/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.ht: 11d: aIn file included from t/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.ha:1175,: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.hfl:a271g:119,: dwarning: atunused variable 'ptr' [-Wunused-variable]a 2, flag2; | ^~~~~ 271 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h : 145 :u21i:n twarning: unused variable 'flag1' [-Wunused-variable]6 4_t* p145t | r = ruint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ ecvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf16_2, ncclFuncReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf16_2, ncclFuncReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_Prod_bf16_2, ncclFuncReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf16_2, ncclFuncReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf16_2, ncclFuncReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf16_4, ncclFuncReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf16_2, ncclFuncReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf16_4, ncclFuncReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf16_2, ncclFuncReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf16_2, ncclFuncReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf16_2, ncclFuncReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf16_4, ncclFuncReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf16_4, ncclFuncReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf16_4, ncclFuncReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:7:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf16_4, ncclFuncReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf16_2, ncclFuncReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf16_2, ncclFuncReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf16_2, ncclFuncReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf16_4, ncclFuncReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf16_4, ncclFuncReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf16_4, ncclFuncReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf16_4, ncclFuncReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf16_4, ncclFuncReduce, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1201. 12 warnings generated when compiling for gfx90a. [ 83%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_bf8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_bf8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_bf8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_bf8.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1102. 12 warnings generated when compiling for gfx90a. [ 83%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_f16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_f16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_f16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_f16.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | In file included from barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | c onst i nt w = thre adIdx.x/WAR P_SIZ barE; \r i e| r_by_group(); | ^~~~~~~~~~~~~~~~~~ ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group'In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_In file included from group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h: In file included from :/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h29::17415: :/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h :note: 75expanded from macro 'barrier_by_group' :7: 29warning: unused variable 'w' [-Wunused-variable] 75 | barrier_b | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from y_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2,:2 : In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.hf:145:14: warning: unused variable 'data1' [-Wunused-variable] l145 | a uint32_tg data1, flag21, data2, f;lag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | u| int3 ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 2_t data1, fl145 | uagi1, datnta32_t2 data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ , flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const iIn file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:warning: 15unused variable 'w' [-Wunused-variable] 80: | ba rrier_by_note: gexpanded from macro 'barrier_by_group' 29 | roup(); c| onst int w = threadIdx.nt w = threadIdx.x/WARP_SIZE; \ | ^ x/WARP_SIZE; \ | ^ ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | ui nt64_t* uptr = recvPtir(0)+ll128Onffts6e4_t* ptr =t; | ^~~ recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf8_2, ncclFuncReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf8_2, ncclFuncReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf8_4, ncclFuncReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf8_2, ncclFuncReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf8_4, ncclFuncReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf8_2, ncclFuncReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_Prod_bf8_2, ncclFuncReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf8_2, ncclFuncReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ _UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf8_2, ncclFuncReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf8_2, ncclFuncReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf8_2, ncclFuncReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NC:670:15:C warning: initializer order does not match the declaration order [-Wreorder-ctor] L670 | tid(_tid), nthrPeads(nthreadsR), tidInBloOck(thTrO_SIMeaPdIdxL.E]x)/,NCCL_STE Pgroup(grS/sizeof(T) ou:p), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | s tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_t e pSize_) { 671 | s| tepS ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.hize(stepSiz:e33_:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here == 0 ? n 33 | prims(tcclShmem.comm.buffSizes[NCCL_PROIn file included from T/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^O_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Redid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf8_2, ncclFuncReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn)uce_R ING_SIMPLE_RProd_bf8_u4, ncclFuncnReduce,W FuncProd,o rccl_bfloatr8, NCCL_ALGkO_RING, NCCCL_PROTO_oSIMPLE, 4) l| ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hl().ty, redop, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf8_2, ncclFuncReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf8_2, ncclFuncReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf8_4, ncclFuncReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf8_4, ncclFuncReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf8_4, ncclFuncReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf8_4, ncclFuncReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, wo/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf8_4, ncclFuncReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), trk->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf8_4, ncclFuncReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBln, T, RedOp, Algo, Proto, COLL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf8_4, ncclFuncReduce, FuidInock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Block(threadIdx.x), group(group), | ^~~~~~~~~~~ ncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_bf8_4, ncclFuncReduce, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] In file included from 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint y, head, mantissa; | ^ 32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp 145 | ui:nt32_t da2ta1, flag1: , data2, flIn file included from ag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h 145 | :11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.hIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7::145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uinwarning: t32_t dataunused variable 'w' [-Wunused-variable]1, flag1 , data2, flag2; | ^~~~~ 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp75:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable]/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t d/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ ata1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.hIn file included from :80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_tIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ * ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f16_2, ncclFuncReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f16_2, ncclFuncReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f16_2, ncclFuncReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f16_2, ncclFuncReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f16_2, ncclFuncReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWo/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.hrkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_Prod_f16_2, ncclFuncReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f16_4, ncclFuncReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f16_2, ncclFuncReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f16_2, ncclFuncReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f16_2, ncclFuncReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f16_4, ncclFuncReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreathreadIdx.x), group(group), | ^~~~~~~~~~~ ds(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f16_4, ncclFuncReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f16_2, ncclFuncReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f16_4, ncclFuncReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f16_4, ncclFuncReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f16_2, ncclFuncReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~670 | tid(t | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | id), nthreadstepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->cCOLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f16_4, ncclFuncReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ onnIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f16_4, ncclFuncReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f16_4, ncclFuncReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f16_4, ncclFuncReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f16_2, ncclFuncReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f16_4, ncclFuncReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f16_4, ncclFuncReduce, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx1200. 1111 warnings generated when compiling for gfx908. warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1101. 12 warnings generated when compiling for gfx90a. [ 83%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_f32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_f32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_f32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_f32.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp 12 warnings generated when compiling for gfx90a. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ [ 84%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_f64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_f64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_f64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_f64.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.hIn file included from :173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15:/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2 note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ : In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, dataIn file included from 2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: 145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ ARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f32_2, ncclFuncReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_Prod_f32_2, ncclFuncReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f32_4, ncclFuncReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSi670 | tzid(tid)e, nthreadss(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_[NCPCL_PROTO_RSIMPLE]/NCOCL_STEPS/sTizeof(T)O : stepSi_ze_) { S| IMPLE]/NC ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | C group(group L_STEPS/siz/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.he:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested hereof(T) : stepSize_) 33 | prims(t { | id, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group ,/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | w primso(tid, nrthreads, &kring->prev,- &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->c>connIondex, wornk->connInndex); | ^I /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5n: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested hered ex, 63 | w ork->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here r unRing(gtid, , 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().ru Pnroto(, COtLL_UiNROLdL>(),.run (tids, suubtnb, wotrk); n | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp,:7: 1: note: win instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here o7 | rDEFIkNE_);nc | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f32_2, ncclFuncReduce, FuncProclDedvFun,c(Red uce_fRING_SIMPlLE_Prood_fa32_2t, nc,clFu ncReNdCCL_ALGOuc_e, FRuncPIrNG, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:od, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ (tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f32_2, ncclFuncReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f32_2, ncclFuncReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f32_2, ncclFuncReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Prod_f32_4, ncclFuncReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f32_2, ncclFuncReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f32_4, ncclFuncReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidIn/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:B11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: lwarning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | o tid(tid), cnthreads(nkthreads), (tidInBlock(tthreadIdxh.x), groupr(group), e| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671a | stepSidIdze(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeox.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) f(T) {: stepSi ze_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(| tid, nth group(groupreads, &r i/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here id, ng->prev, n&ring->ntext, worhk->sendbuffr, eads, &ring->pwork->recvbuff, work->redOpArg, 0, work->connIn 432 | if (tid < subtn) RunWorkColl().run(tiddex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5:, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f32_2, ncclFuncReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here rev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | r 63 | u runnRing(ti(tid, nthreadsk), work); | ^; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | : 432 if (:tid <78 subt:n) Ru nWorknote: Coll, 1, 2, 4>::run' requested hereFn, T , 432 RedOp, Algo, Proto, COLL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f32_4, n7c | DEFIcNE_nlFuncReduce, FucclnDevFcunc(PReducre_RIoNG_SdIMPL,E_Pr od_ff32_2l, nccolFunacRedtuce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); , NC\CL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | | ^ tid(tid), nthread /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hs:670:(15:n note: field 'nthreads' will be initialized after field 'tidInBlock' t 670h | r teid(taid)d, snth)rea,ds( ntthrieadds),I tindInBBlolck(othrceadkIdx.(x),t grhoupr(gerouap),d | I ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hd:670:x60: note: .field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f32_4, ncclFuncReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f32_2, ncclFuncReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_)/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->con, tidInnBlock(threIadIdx.x), ngroup(gdroup), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~e | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671x | stepS,ize(stepSiz e_ == 0 ? wncclShmork->connIndex);em.co mm.buf fSiz| ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | pT, RedOp, Proto, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f32_4, ncclFuncReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ prAev, &rling->negxt, owork->,sendbu ffProto, COLL_UN, Rwork->OrecvLL>().run(tid, subuffb, wortk->rendOpAr,g, 0, work-w>connoIrknde); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINx, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tic(Reduce_RING_SIMPLE_Prod_f32_4, ncclFuncReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f32_4, ncclFuncReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ coll, ty, redop, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads),670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f32_2, ncclFuncReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f32_4, ncclFuncReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f32_4, ncclFuncReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f32_4, ncclFuncReduce, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const inIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ t w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll12In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7:8Offset; | ^~~ warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f64_2, ncclFuncReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f64_4, ncclFuncReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f64_2, ncclFuncReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f64_2, ncclFuncReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_Prod_f64_2, ncclFuncReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Redu,U tidInBNlock(tRhreadIOdx.x),L groupL(group>), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~( | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ )671 | .stepSirze(stepuSize_ ==n 0 ? n(cclShmtem.comim.bud, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here ce_RING_SIMPLE_Prod_f6 ffSizes[N47_2, ncclFuncReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ zeof(T) | DEFINE_ncclDevFunc(Reduce_RING_SIMPL/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclE_Prod_f64_2, ncclFuncReduce, FuncSPhmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f64_4, ncclFuncReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE,rod, double, NCCL_ALGO_ : stRepSize_) { I | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group N/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33G:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here , 33 | NprimCs(tid,C nthreLads, _&ring-P>prev,R &ringO->nexTt, workO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWor->sendbuffk, workB->recvbauff, twork->credOpAhrg, 0, , algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ rkc-oll, ty, redop, algo, proto>connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: , unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x),in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | g runrRingr(tid, nothreadus, worpk); | ) ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:,432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | | if (ti ^~~~~~~~~~~~~~~~~d < su btn) R/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), unWotrkColl()c.ruk(threadIdx.x), gron(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:12:1: up(group), | ^~~~~~~~~~~ note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f64_4, ncclFuncReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f64_2, ncclFuncReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f64_2, ncclFuncReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f64_4, ncclFuncReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f64_4, ncclFuncReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f64_2, ncclFuncReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f64_2, ncclFuncReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f64_4, ncclFuncReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f64_4, ncclFuncReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f64_4, ncclFuncReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f64_4, ncclFuncReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f64_2, ncclFuncReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f64_2, ncclFuncReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f64_4, ncclFuncReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f64_4, ncclFuncReduce, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1201. 12 warnings generated when compiling for gfx90a. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx1102. [ 84%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_f8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_f8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_f8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_f8.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1101. 12 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx1201. In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ [ 84%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_u32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_u32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_u32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_u32.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from :175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:In file included from 2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barIn file included from r/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cppi:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from e/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:r80:5: warning: unused variable 'w' [-Wunused-variable] _80 | bbarry_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | i er_by_gr oup(); | c ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: onote: expanded from macro 'barrier_by_group' 29 | nst c onsint w t= int w t =hreadIdx.x/W thrAeadIdx.x/RPW_ARP_SIZE; \ | ^ SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable]/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | ui271nt | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21:In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flaIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ g1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t dIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ ata1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f8_2, ncclFuncReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f8_2, ncclFuncReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f8_2, ncclFuncReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f8_4, ncclFuncReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f8_4, ncclFuncReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f8_2, ncclFuncReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthread/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? nccs, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432lShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); LL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f8_4, ncclFuncReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_Prod_f8_2, ncclFuncReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(s/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.htep:Size_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMP670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tidIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | LE]/ ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~NCCL_S TEPS/s izeof| (T) : st tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_epSize_ ) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7671: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | | stepSize(stepSize_ == 0 ? ncclSh prims(mtid, nthereads, &mring->pr.ev, &rinc, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f8_2, ncclFuncReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ gomm.buffSizes[NCCL_PROTO_SIMPLE]->next, /work->seNndbuff, Cwork->reCcvbuff, Lwor_Sk->redOpArg, 0, work->coTEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f8_2, ncclFuncReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ nnIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f8_2, ncclFuncReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f8_2, ncclFuncReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f8_4, ncclFuncReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f8_2, ncclFuncReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreadIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33s, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here In file included from 432/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:2 | : In file included from | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Pr/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11 : In file included from if/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h (tid < subtn) Run:670:15W: warning: initializer order does not match the declaration order [-Wreorder-ctor] o670 | tird(tkColild), nt, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ L>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f8_4, ncclFucnk(thrceadIdxR.x), egroudp(grouup), c | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ e| tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ ,671 | stepSFize(stuepSizne_ ==c 0 ? PncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tidrod, rccl_float8, NCCL_ALGO_RI < subtn) RunWorkColl().run(tid, suNG, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, btn,a wolrk);g | ^o /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp, p:7r:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f8_2, ncclFuncReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ oto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f8_4, ncclFuncReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[N/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.bCCL_PROTO_SIMPLE]/NCCL_STEPS/suffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f8_4, ncclFuncReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_Sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->r 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ecvbuff,IMPLE_Prod_f8_4, ncclFuncReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f8_4, ncclFuncReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f8_4, ncclFuncReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_f8_4, ncclFuncReduce, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.hIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ :11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2In file included from : In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ , flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u32_2, ncclFuncReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_Prod_u32_2, ncclFuncReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u32_2, ncclFuncReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u32_2, ncclFuncReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u32_2, ncclFuncReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u32_4, ncclFuncReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u32_2, ncclFuncReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u32_2, ncclFuncReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(grouIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u32_2, ncclFuncReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreap), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u32_2, ncclFuncReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ds), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u32_2, ncclFuncReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u32_4, ncclFuncReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u32_4, ncclFuncReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u32_4, ncclFuncReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u32_4, ncclFuncReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Re:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u32_4, ncclFuncReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ duce_RING_SIMPLE_Prod_u32_4, ncclFuncReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u32_2, ncclFuncReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads),(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u32_4, ncclFuncReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u32_4, ncclFuncReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u32_4, ncclFuncReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u32_2, ncclFuncReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u32_4, ncclFuncReduce, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1101. 12 warnings generated when compiling for gfx90a. [ 84%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_u64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_u64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_u64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_u64.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp 12 warnings generated when compiling for gfx90a. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1201. 1111 warnings generated when compiling for gfx1102. warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ [ 85%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_u8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_u8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_u8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_u8.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cppIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const7: warning: unused variable 'w' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ i 75n | t b arriwer_b y_gr=oup( ); | t ^~~~~~~~~~~~~~~~~~In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ :15: rnote: expanded from macro 'barrier_by_group'In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ eadIdx.x/WARP_ S29 | IZE; \ | ^ const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.hIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ :174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11 ^~~~~: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h: 174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:7: warning: unused variable 'w' [-Wunused-variable] :75 | 145 :barr28ier_:by_ gwarning: unused variable 'data2' [-Wunused-variable] 145 | urouip();n | t ^~~~~~~~~~~~~~~~~~ 3/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h2_t data1, flag1, data:229:15,: note: expanded from macro 'barrier_by_group' 29 | f l coanst gint w = thrIn file included from eadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barri2e; | r ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h_by_group(); | ^~~~~~~~~~~~~~~~~~:145:35 : warning: unused variable 'flag2' [-Wunused-variable]/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h 145 | : u29int:32_t15 dat:a 1,note: flagexpanded from macro 'barrier_by_group'1, d ata2 , In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 29 | flagc2; o| ^~~~~ nst int w = threIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ adIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cppunused variable 'w' [-Wunused-variable]:2 : In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from 80/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h: | 175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h: 271: 19 : bwarning: unused variable 'ptr' [-Wunused-variable]a r271r | i e r_ b uyin_t6g4_rt* optur p= (r); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15:In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ note: expanded from macro 'barrier_by_group' 29 | const int w = tehcvPtr(0r)+ll128Oeffset; | ^~~ adIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = rIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ ecvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u64_2, ncclFuncReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncc| tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ l671 | stepSSize(stepSizhe_ == 0 ? nmcclShmem.coemm.buffSimzes[NCCL_.PROTO_SIMPcLE]/NCCL_SToEPSmm.buffS/isizeof(T)zes[NCCL_P :R stepSizeOTO_S_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here IMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &rin 33 | g -prims(>tidnext, work->sendbuff, nth,reads, &ring->prev,w &ringo->nextr, work->skendb->recvbuff, work->redOuffp, work-A>recvburff, wogrk-, 0, work->>recdOpArgo, 0, wnornIndk->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRingconnIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl(tidO, nthrpeads, ,work); | ^ A/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: lnote: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432g | o if (ti,d < su btn) RuPnWorkCrollL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u64_4, ncclFuncReduce, FuncProd,R edOp, Proto, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u64_2, ncclFuncReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_Puint64_t, NCCL_ALGO_ReI_RING_SNIMPLE_GProd_u64,_2, n cclFuNncReducCCL_PROTO_SIMe,P FunLcProdE, uin,t64_ t, N4CCL_) | ^ AL/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:GO_62RING:, NCCL_PROTO_SRIOTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ MPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' note: 611 | expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run() ; Ru nWor\kB atch , algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthr| ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.he:670:a15: note: dfield 'nthreads' will be initialized after field 'tidInBlock' s670 | ( tind(titd)h, nthrreadesads), tidI(nntBhrleaodsc),k t(idtInhBlreadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670o | c k(t hr ea tdIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), ntidh(tride),a ndthsre(adsn(ntthhrreades)a, dtisdI)n,Bl octk(ithrdInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ eadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), n/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.hthreads(nthreads), :t508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, woidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | rk->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_Prod_u64_2, ncclFuncReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u64_2, ncclFuncReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u64_2, ncclFuncReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u64_2, ncclFuncReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u64_2, ncclFuncReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u64_2, ncclFuncReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ InBlock(t(h).run(); r\ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.he:670:15:a note: field 'nthreads' will be initialized after field 'tidInBlock' 670d | Itdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSiid(tzid), ntehreads((nthreads)s, tidIntBlock(tehreadIdpx.x), gSroup(grizeoup),_ == 0 ? | ^~~~~~~~~~~~~~~~~n /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670cclS| h ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ m:60: note: field 'group' will be initialized after field 'stepSize' e m.c670 | omm.buffSize s tid(ti[d), nthNCCL_PROTO_SIMPLE]rea/ds(nthrNeads), tCidInBloCLck(threadIdx.x), group(group), | ^~~~~~~~~~~_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads , &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u64_4, ncclFuncReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u64_2, ncclFuncReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u64_2, ncclFuncReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), t15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u64_4, ncclFuncReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ idInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u64_4, ncclFuncReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u64_4, ncclFuncReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u64_4, ncclFuncReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u64_4, ncclFuncReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u64_4, ncclFuncReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tigroup), | ^~~~~~~~~~~ d), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u64_4, ncclFuncReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u64_4, ncclFuncReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u64_2, ncclFuncReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u64_4, ncclFuncReduce, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa;/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12 : In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h: 14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y,In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIn file included from Idx.x/WA/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2R: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:P173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: _unused variable 'w' [-Wunused-variable] 75 | S barrIier_by_grZoup(); | E ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: ;expanded from macro 'barrier_by_group' \ 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp| :2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11 ^~~~~~~~~~~~~~~~~~: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h: 75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barr/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.hier_by_group:(); | ^~~~~~~~~~~~~~~~~~29 /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29::15: note: expanded from macro 'barrier_by_group' 2915 | con:st int w = thrnote: eadIdx.expanded from macro 'barrier_by_group'x/WARP_SI ZE; \ | ^ 29 | const int w = threadIIn file included from dx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145145:14: warning: unused variable 'data1' [-Wunused-variable] | 145 | u int32_t da ta1, fl ag1, dat a2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h145:35: warning: unused variable 'flag2' [-Wunused-variable] 145: | 145 uint:32_t 35data1:, fla g1, dwarning: ata2,unused variable 'flag2' [-Wunused-variable] flag 2; | ^~~~~ 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barri/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11er: _by_In file included from group(/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h); | : ^~~~~~~~~~~~~~~~~~ 175/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15:: note: expanded from macro 'barrier_by_group' /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h 29 | : con80st:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); int w = threadIdx.x/WARP_SIZE; \ | ^ | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadI/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ dx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flIn file included from ag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: warning: unused variable 'flag2' [-Wunused-variable] 145 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ uint32_t data1, flag1, data2, flag2;/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ : warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable]In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u8_2, ncclFuncReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->c670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ onnIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_Prod_u8_2, ncclFuncReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u8_2, ncclFuncReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->senIn file included from dbuff, work->recvbuff, w/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | ork->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u8_4, ncclFuncReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4 group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(thread, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u8_2, ncclFuncReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Idx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u8_4, ncclFuncReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u8_2, ncclFuncReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid),In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nt nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ hreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u8_2, ncclFuncReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u8_2, ncclFuncReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlIn file included from ock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u8_4, ncclFuncReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u8_2, ncclFuncReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u8_2, ncclFuncReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->neIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ xt, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u8_2, ncclFuncReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u8_2, ncclFuncReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u8_4, ncclFuncReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, wo/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nrk->cotnnIndex, whork->cIn file included from reaonnIndedx); | s ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here )63 | runRi,ng(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested hereUNfRO LL>S( ).irun(tzid33, seu | btns, wo[r k);N | ^ C /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cppC:12:1: Lnote: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here _12 | D PEFINEpR_nccrOlDeviTFumOnc(sRedu_(ce_RStING_SIiIMPLMdE_PrP,od_uL 8_4,En ncclt]Fun/hcRedurNce, eFCuncaPCrodd,L uisn_t8_,tS, NCTCL_/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] &AErPS/sizeof( 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkCT) : stLGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | Runing->prev, &ring->next, work->sendbuff, work->recepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u8_2, ncclFuncReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ vbuff, work->redOpArg, 0, work->connIndex, work->WorckBaotch, )algo;, p ro to,| un ^rol l>()/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h.run();: \63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432 | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670oll().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u8_4, ncclFuncReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads):78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevF, tuidInnBlcock((thRreaedIddx.xu), gcrouep(g_rouRp),I | N ^~~~~~~~~~~ G_SIMPLE_Prod_u8_4, ncclFuncReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u8_4, ncclFuncReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u8_4, ncclFuncReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u8_4, ncclFuncReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_prod_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Prod_u8_4, ncclFuncReduce, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx1100. 12 warnings generated when compiling for gfx90a. [ 85%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_bf16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_bf16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_bf16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_bf16.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp.:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11x: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h/:145:14: warning: Wunused variable 'data1' [-Wunused-variable] 145 | A uintR32_t datPa1, fla_g1, dataS2, flagI2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.hZ:145:21: Ewarning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: ; \ | ^ warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp :2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:757: warning: unused variable 'w' [-Wunused-variable] | 75 | barrier_ b y_gr oup(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.hbarrier_by_group(); | ^~~~~~~~~~~~~~~~~~ :/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h29:15: note: expanded from macro 'barrier_by_group' 29 | : con29st int w: = thre15adIdx.x/:WARP_SIZ E; \ | note: ^ expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.hf:145:21: lwarning: unused variable 'flag1' [-Wunused-variable] 145 | a uintg32_t da2ta1, fl;ag1, da ta2, fl ag2; | | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145: ^~~~~28: warning: unused variable 'data2' [-Wunused-variable] 145 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.huint32_t data1:, flag1, data2, flag1452; | ^~~~~ :/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: 28warning: unused variable 'flag2' [-Wunused-variable] : warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t dat145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ a1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 11 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier80 | _ barrbier_byy_gro_up();g | ^~~~~~~~~~~~~~~~~~ r/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29o:15: note: expanded from macro 'barrier_by_group' 29u | cponst (int w) = ;threa dIdx .x/W| ARP_S ^~~~~~~~~~~~~~~~~~IZE; \ | ^/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h :29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 11 warnings generated when compiling for gfx906. 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ ; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; In file included from \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t daIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:ta1, flag1, data219: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ , flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] In file included from 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 11 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 11In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 11 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf16_2, ncclFuncReduceScatter, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf16_2, ncclFuncReduceScatter, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWork12 warnings generated when compiling for gfx90a. Coll().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf16_4, ncclFuncReduceScatter, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf16_2, ncclFuncReduceScatter, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf16_4, ncclFuncReduceScatter, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf16_2, ncclFuncReduceScatter, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf16_4, ncclFuncReduceScatter, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15:In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf16_2, ncclFuncReduceScatter, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group (stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSi[ 85%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_bf8.cpp.o ze_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_MinMax_bf16_2, ncclFuncReduceScatter, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work/usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_bf8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_bf8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_bf8.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp ->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf16_2, ncclFuncReduceScatter, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : sIn file included from tepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, n/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cppthr:ead2s, : &riIn file included from ng-/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h>pr:ev11, &: rinIn file included from g->/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.hnext, :173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:work->sendbuff, work->recvbuff, wo670r:15:k warning: initializer order does not match the declaration order [-Wreorder-ctor]- >670 | r e tidd(tOid)p, nAthrreadgs(n,thr ea0d,s) work->connIndex, work->con,n ItidInnBldocek(xt)hr;ea dI dx| .x ^), g/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.hroup:(g65ro:up5),: | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here| tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | 65 | ru stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[nRNiCnCgC(tCidL, nthr_eaSdsT, EwPoSrk/);s i | ^z /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.he:o432f:(78:T note: )in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here :432 | stepS if (tid < subtn) RunWoizre_k) C{ o | l ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ l | < group(group Fn/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h,: 34:T7:, note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here R e34 | d O p p, Algo, Proto, COLrims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf16_2, ncclFuncReduceScatter, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ L_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf16_4, ncclFuncReduceScatter, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf16_2, ncclFuncReduceScatter, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf16_4, ncclFuncReduceScatter, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf16_2, ncclFuncReduceScatter, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf16_4, ncclFuncReduceScatter, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf16_4, ncclFuncReduceScatter, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf16_4, ncclFuncReduceScatter, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ .x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf16_4, ncclFuncReduceScatter, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid11, subtn, work); warnings generated | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp when compiling for host:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7. | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf16_2, ncclFuncReduceScatter, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf16_2, ncclFuncReduceScatter, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf16_4, ncclFuncReduceScatter, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf16_4, ncclFuncReduceScatter, FuncMinMax, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ E; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP* ptr = recvPtr(0)+ll128Offset; | ^~~ _SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf8_2, ncclFuncReduceScatter, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf8_2, ncclFuncReduceScatter, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf8_2, ncclFuncReduceScatter, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf8_2, ncclFuncReduceScatter, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSizIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:e_ == 02 ? ncclS: hmem.comIn file included from m./builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hbu:ffSizes[N670CC:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | L_PROTO_SIMPLE]/NC CL_STEPS/tsizeof(T)i : stepSidze_) {( | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(groupt /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7:i note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | d prims), nthreads(nt(tidh, nthreadrs, eads), tidInBl&riong->prev,c &rk(threadIdx.x)ing->nex,t, work->sendbuff, work->recvb group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeofuff,( work->TredOpArg), 0, wo rk->con:nIndex stepSize_) {, work ->connI ndex); | | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | run| Ring(tid, nthr:34:7eads:, note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here work);34 | prims(t| ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hi:432:78: note: din instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here , nthreads, &ring->prev, 432 | if (tid < subtn) RunWorkCollnext, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->coTn, RednOp, AIlgo, Prnoto, dCOLL_eUNROLxL>().)run(t;id, s ubtn, work| ); | ^ ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp :12:/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | D:EFINE65_nc:5cl: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | nccl ^Func Reduc/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.heScatter, Fun:432:78:cMinMax, rccl_bfloat8, NCCL note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl:611:(62: note: )expanded from macro 'DEFINE_ncclDevFunc' .611 | r RunuWorkBnatch( ^, a /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFlugo, nprotco, (unroRll>(e).rudn();u \ | c ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.he:670:15S: note: field 'nthreads' will be initialized after field 'tidInBlock' c atter_RING_SIMPLE_MinMax_bf8_2670 | , ncclFuncRe tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x)duc,eS catter, FuncMinMax, rccl_bfloat8, NCCL_ALGO_grRoupI(grNoGup),, | ^~~~~~~~~~~~~~~~~ N/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:C670:60C: note: Lfield 'group' will be initialized after field 'stepSize' _670 | P R tiOdT(tiOd),_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch nt , algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf8_4, ncclFuncReduceScatter, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf8_4, ncclFuncReduceScatter, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf8_2, ncclFuncReduceScatter, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hSTEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | i:670:15: warning: finitializer order does not match the declaration order [-Wreorder-ctor] 670 | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf8_2, ncclFuncReduceScatter, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h: tid(tid(), nthrteads(nthireads), tdidInBloc k(threadI().run(tid, su.bbuffSitzes[NCCL_nPROTO_SI,MPL work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cppE]/N670C:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ CL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | :12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf8_4, ncclFuncReduceScatter, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf8_4, ncclFuncReduceScatter, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf8_2, ncclFuncReduceScatter, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf8_2, ncclFuncReduceScatter, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_MinMax_bf8_2, ncclFuncReduceScatter, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf8_2, ncclFuncReduceScatter, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf8_4, ncclFuncReduceScatter, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWor | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_)kColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf8_2, ncclFuncReduceScatter, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf8_4, ncclFuncReduceScatter, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf8_4, ncclFuncReduceScatter, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf8_4, ncclFuncReduceScatter, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf8_4, ncclFuncReduceScatter, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_bf8_4, ncclFuncReduceScatter, FuncMinMax, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1101. 12 warnings generated when compiling for gfx90a. [ 85%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_f16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_f16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_f16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_f16.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ : In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp ^ :2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from 2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WAIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] R P_SIZE; \ | 271 ^ | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtrIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | con(0)+ll128Offset; | ^~~ st int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = In file included from threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 11 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: const inwarning: tunused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ uint32_t data1, flag1, data2, flag2;In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlizes[NCCL_ock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f16_2, ncclFuncReduceScatter, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f16_2, ncclFuncReduceScatter, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f16_2, ncclFuncReduceScatter, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f16_4, ncclFuncReduceScatter, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(grou/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f16_4, ncclFuncReduceScatter, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f16_4, ncclFuncReduceScatter, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f16_2, ncclFuncReduceScatter, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f16_2, ncclFuncReduceScatter, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreadsIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f16_2, ncclFuncReduceScatter, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid)), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f16_2, ncclFuncReduceScatter, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run()In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f16_2, ncclFuncReduceScatter, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_MinMax_f16_2, ncclFuncReduceScatter, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f16_2, ncclFuncReduceScatter, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f16_2, ncclFuncReduceScatter, FuncMinMax, half, NCCL_ALGO_In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f16_2, ncclFuncReduceScatter, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hRING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f16_4, ncclFuncReduceScatter, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f16_4, ncclFuncReduceScatter, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f16_4, ncclFuncReduceScatter, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ : note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f16_4, ncclFuncReduceScatter, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f16_4, ncclFuncReduceScatter, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ >, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f16_4, ncclFuncReduceScatter, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f16_4, ncclFuncReduceScatter, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMinMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMinMax<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f16_4, ncclFuncReduceScatter, FuncMinMax, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. 12 warnings generated when compiling for gfx90a. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1200. [ 86%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_f32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_f32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_f32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_f32.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx908. 12 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ [ 86%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_f64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_f64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_f64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_f64.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cppIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] : warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f32_2, ncclFuncReduceScatter, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f32_2, ncclFuncReduceScatter, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f32_2, ncclFuncReduceScatter, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f32_4, ncclFuncReduceScatter, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f32_4, ncclFuncReduceScatter, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f32_2, ncclFuncReduceScatter, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f32_4, ncclFuncReduceScatter, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f32_2, ncclFuncReduceScatter, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f32_2, ncclFuncReduceScatter, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ringIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | ->nex t, workt->sendbiuff, wodrk->rec(vbuff, twork->iredOpd), nthreads(nthreads), tidInBlock(thArrg, 0,e work->caonnIndedx, workI->connIdndex);x.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65s:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here t 65 | reunRing(tid, nthreads, work)/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(t.biuffSidzes[N)CCL_P,ROTO_S IMPLEn]/NCCtL_STEhPS/srizeofe(T) :a stepdSize_s) { ( | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(groupn /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.hthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_MinMax_f32_2, ncclFuncReduceScatter, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ :34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads,; | ^& /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432r:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested herei 432 | n gif (t-id < >subtnp) RunrWorkCeoll(n).rung(tid,- subtn>, wornk); e| ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cppx:7:1t: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here , 7 work->sendbuff, work->recvbuff, wo | DEFINE_ncclDevFunc(ReduceScatter_RING_SIrk->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tMiPLE_MdinMa, nx_ft32_2,h nccrlFeuncRaeducdeScastter,, Fun cMinwMax,o florat, NCCL_AkLGO_)RING,; NC CL_PR OTO_| SI ^MPL E, /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: :expanded from macro 'DEFINE_ncclDevFunc' 432611 | : 78 Ru:nWor kBanote: tin instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (ch,> a(lgo), .prortou, nun(rotll>i().rdun(),; \ | s ^ u/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hb:670t:15n: ,note: field 'nthreads' will be initialized after field 'tidInBlock' w670 | o r tkid()tid;), nt h| reads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f32_2, ncclFuncReduceScatter, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f32_2, ncclFuncReduceScatter, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f32_2, ncclFuncReduceScatter, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f32_2, ncclFuncReduceScatter, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f32_4, ncclFuncReduceScatter, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f32_4, ncclFuncReduceScatter, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f32_4, ncclFuncReduceScatter, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f32_4, ncclFuncReduceScatter, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f32_4, ncclFuncReduceScatter, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^Size_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f32_4, ncclFuncReduceScatter, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(gro tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ up), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f32_4, ncclFuncReduceScatter, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f32_4, ncclFuncReduceScatter, FuncMinMax, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.hIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ ); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = thrIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ eadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group()/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ ; | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ uint64_t* pIn file included from tr =/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2 : In file included from recvPtr(0)+ll128/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:O7: warning: unused variable 'w' [-Wunused-variable]ffset; 75 | | ^~~ barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:1742: : /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h7:11: In file included from : warning: unused variable 'w' [-Wunused-variable] 75 | ba/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145r:14: warning: unused variable 'data1' [-Wunused-variable] r145 | ier_b yuint32_t data1, flag1, data2, flag2; | ^~~~~ _group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp| ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29::15: note: 2expanded from macro 'barrier_by_group' 29 | : In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29 : const15 int w =: thread Idx.x/Wnote: ARP_SIZexpanded from macro 'barrier_by_group'E; \ | ^ 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = re/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:cvPtr(02)+ll128: Offset;In file included from | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f64_2, ncclFuncReduceScatter, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f64_2, ncclFuncReduceScatter, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f64_2, ncclFuncReduceScatter, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f64_2, ncclFuncReduceScatter, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(gr/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f64_4, ncclFuncReduceScatter, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ oup), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f64_2, ncclFuncReduceScatter, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f64_4, ncclFuncReduceScatter, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f64_2, ncclFuncReduceScatter, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Red/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f64_4, ncclFuncReduceScatter, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ uceScatter_RING_SIMPLE_MinMax_f64_2, ncclFuncReduceScatter, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f64_4, ncclFuncReduceScatter, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), 11 warnings generated when compiling for host. tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.hwork->connIndex); :508| ^: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:6529:: 5: warning: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid r%unRingW(Stid, nIthZE), warpreads,( worktid/WARP_SIZE); )| ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:,432:78: | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | wif (tiad rpI< nsuBlock(threbtn)a RundWoIdx.x/WARP_SIZErkC)oll=().run(=tid3),, subtn, work group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.); b | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:12:u1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here f12 | DEfFINE_nScclDeviFunczes[NCCL_PROTO_LL12(8Reduc]eScatte/r_RINGN_SIMPLCE_MinMaCx_L_STEPS/sif64_4z, ncclFeof(uint64_t)uncRe)duceSc atte{ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r, Fu | group(groupncMinM /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.hax,:34:7:In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 double, | NCCL_ ALG O_RINGprims(, tNCCLid, nt_PRhOTO_reaSdIMPs, &ring->pLE,r 4) | e^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hv,:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connI 611n | RudnWorkBeatch, algo, pro432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f64_2, ncclFuncReduceScatter, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ to , unr| oll>() ^.run( ); /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:79\ | : ^ 5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h :670:r15: note: ufield 'nthreads' will be initialized after field 'tidInBlock' n670 | tidR(tidi), nnthregads(s),( tidtInBliock(tdhrea,dIdx. x), ngrotup(ghrIn file included from reaoup), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2ds, : worIn file included from k);/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h | : ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h11:432:: 78: In file included from note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h432 | : 173 if : (ti/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hd 670::<60: note: 670field 'group' will be initialized after field 'stepSize's :u670 | 15 b : tti dn(twarning: )id), nthrinitializer order does not match the declaration order [-Wreorder-ctor]Reau d ns(nWthre670oads | r), ktid CInB tid(tid), nthreads(nthreads), tidInBloll().drun(Itiloddck(x,thr. eaxsdId)ux.x,b) , grgouptr(groonup,u), p | ^~~~~~~~~~~ (wgroork); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceSup), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | catte r_RI NG_L L128 _Min Max_f 64_2p, ncrclFuncReduceScatter, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ ims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f64_2, ncclFuncReduceScatter, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f64_2, ncclFuncReduceScatter, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f64_2, ncclFuncReduceScatter, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f64_4, ncclFuncReduceScatter, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ L_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f64_4, ncclFuncReduceScatter, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f64_4, ncclFuncReduceScatter, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f64_4, ncclFuncReduceScatter, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f64_4, ncclFuncReduceScatter, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f64_4, ncclFuncReduceScatter, FuncMinMax, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1101. 12 warnings generated when compiling for gfx90a. [ 86%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_f8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_f8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_f8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_f8.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx1200. 12 warnings generated when compiling for gfx90a. [ 86%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_u32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_u32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_u32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_u32.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: :2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f8_2, ncclFuncReduceScatter, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f8_2, ncclFuncReduceScatter, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f8_2, ncclFuncReduceScatter, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f8_2, ncclFuncReduceScatter, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f8_2, ncclFuncReduceScatter, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f8_2, ncclFuncReduceScatter, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f8_4, ncclFuncReduceScatter, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f8_4, ncclFuncReduceScatter, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f8_4, ncclFuncReduceScatter, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f8_4, ncclFuncReduceScatter, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f8_4, ncclFuncReduceScatter, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f8_4, ncclFuncReduceScatter, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f8_2, ncclFuncReduceScatter, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f8_2, ncclFuncReduceScatter, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f8_2, ncclFuncReduceSIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f8_2, ncclFuncReduceScatter, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ catter, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_MinMax_f8_2, ncclFuncReduceScatter, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f8_2, ncclFuncReduceScatter, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f8_4, ncclFuncReduceScatter, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f8_4, ncclFuncReduceScatter, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f8_4, ncclFuncReduceScatter, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f8_4, ncclFuncReduceScatter, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_f8_4, ncclFuncReduceScatter, FuncMinMax, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u32_2, ncclFuncReduceScatter, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u32_4, ncclFuncReduceScatter, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u32_2, ncclFuncReduceScatter, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u32_2, ncclFuncReduceScatter, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u32_2, ncclFuncReduceScatter, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_MinMax_u32_2, ncclFuncReduceScatter, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u32_2, ncclFuncReduceScatter, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u32_4, ncclFuncReduceScatter, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u32_2, ncclFuncReduceScatter, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u32_4, ncclFuncReduceScatter, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u32_4, ncclFuncReduceScatter, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u32_4, ncclFuncReduceScatter, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u32_4, ncclFuncReduceScatter, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, protIn file included from o, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp7::12:: In file included from note: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.hin instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here: 11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h :7670 | :D15E:F Iwarning: Ninitializer order does not match the declaration order [-Wreorder-ctor]E_ ncclDevFunc(Reduc e670S | c a t t etr_iRdI(NtGi_dS)I,M PnLtEh_rMeiandMsa(xn_tuh3r2e_a2d,s )n,c ctliFduInncBRleoduccke(tShcraetatdeIrd,x .Fxu)n,c MgirnoMuapx(,g ruoiunpt)3,2 _ t| , ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ N C| C tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_L _ALGO_RI N671G | , N C CsLt_ePpRSOiTzO_eS(IsMtPeLpES,i z2e)_ =| =^ 0 ? ncc/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hl:S611h:m62e: mnote: .expanded from macro 'DEFINE_ncclDevFunc'co mm.bu ff611S | i z e sR[uNnCWCoLr_kPBRaOtTcOh_s,i zaelogof(,T )p r:o tsot,e puSnirzoel_l)> ({) . r| u ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n ( )| ; group(group\ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:: 34note: :7field 'nthreads' will be initialized after field 'tidInBlock': note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 670 | 34t | i d ( t i dp)r,i mnst(htrieda,d sn(tnhtrheraedasd,s )&,r itnigd-I>npBrleovc,k (&trhirnega-d>next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here Idx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid (65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u32_2, ncclFuncReduceScatter, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u32_2, ncclFuncReduceScatter, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u32_2, ncclFuncReduceScatter, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u32_2, ncclFuncReduceScatter, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u32_4, ncclFuncReduceScatter, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(thread/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u32_4, ncclFuncReduceScatter, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(Idx.x), group(group), | ^~~~~~~~~~~ ); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u32_4, ncclFuncReduceScatter, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u32_4, ncclFuncReduceScatter, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u32_4, ncclFuncReduceScatter, FuncMinMax, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx1201. 12 warnings generated when compiling for gfx90a. [ 87%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_u64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_u64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_u64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_u64.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp 12 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1,In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flagIn file included from 1, da/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:t11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:a174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h2:145:14: warning: unused variable 'data1' [-Wunused-variable] , flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | ui145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | unt32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ int32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+l/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:l2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:111: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:2175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19:8 warning: unused variable 'ptr' [-Wunused-variable] O271 | f uintfset; 64_t*| ^~~ ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl()(.run(tisd, subttn, weork)p; | S ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:7:1: inote: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here ze_ == 7 | 0DEFIN E_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u64_2, ncclFun? ncccRlShmeme.comm.bduffSizes[NCuCL_PROcTO_SIMePLE]S/NCCL_cSTaEPS/sitzeof(Tt) : steepSri, FuncMinMax, uint64_t, NCCL_ALGO_zeR_I) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group N/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7G: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here , 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sen NCdCL_PRObTO_SIuMPLE,f 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611f:62: note: ,expanded from macro 'DEFINE_ncclDevFunc' 611 | RwunWorkoBatchrk->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u64_2, ncclFuncReduceScatter, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInB/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx..cxomm.buffS)izes[NCC,L_ PROTO_SIMgPLE]/NCCrL_STEPS/osizeof(T)u : stepSpize_)( { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ g| r group(groupoup), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h : | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_34 :7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here =34 | = prims( tid, nth0re ? ncclShmem.comads, m&ring->p.rev, &bruinfgfSizes[NCCL_PR->Onext, woTrk->senOd_SIMPLE]/NCbuff, Cwork->Lr_eSTcvEPS/sizeofbuff, work->re(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring-dOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u64_4, ncclFuncReduceScatter, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ >prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u64_4, ncclFuncReduceScatter, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_MinMax_u64_2, ncclFuncReduceScatter, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connInde432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINx, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u64_2, ncclFuncReduceScatter, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ E_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u64_2, ncclFuncReduceScatter, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x),.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u64_2, ncclFuncReduceScatter, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PRO11 warnings generated when compiling for gfx1101. TO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u64_2, ncclFuncReduceScatter, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u64_2, ncclFuncReduceScatter, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u64_2, ncclFuncReduceScatter, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid),/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h nthreads(nthreads), tidInBlock(threadIdx.:670:15: xwarning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | ) ,t id(tid), nthreads(nthreads),group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h tidInBlock(:threadI34dx.x), g:roup(gr7oup:), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ note: 671 | in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here stepSiz e(step Size_ == 0 ?34 nc | clShm em.comm.b uffS izes [NCCL_P ROTO_SIM PLE]/pNrims(tid, nthreCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(groupads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRinLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColg(ti d, nthTreads,, wor k); R| ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.he:432:d78: Onote: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here p432 | if (, Algo, Proto, COLL_UNROLL>().run(tid, subtn, work);tid < subt| n ^) Run Work/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cppColl().run(tid, subtn, wo1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Mirnk);M a| ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cppx:_12:u1:6 note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here4 _12 | D2EFI,N E_nnccclclFuDevFncuRnced(uRceeduScceaStter, FuncMinMax, uint64_t, NCcaCtteLr_R_INGA_SILMPLGE_MOinMax__u6R4_4I, nNccGlFu,ncR eduNceSCcatCteLr_PROT, FuncOM_SinMIMaxPLE, 2) | ^, uint6 4_t/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h, NCC:L_A611LG:O62: note: _RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hexpanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670::15611::62: note: expanded from macro 'DEFINE_ncclDevFunc' note: field 'nthreads' will be initialized after field 'tidInBlock' 611 | RunWo670rkB | atc h< col l, t yt, riedodp,( talgio, dpro)to,, n untrolhl>reads(nthreads), tidInBlock(t().hrunr();e \ a | ^ d/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:I670:15d: note: xfield 'nthreads' will be initialized after field 'tidInBlock' .670 | x ) ti,d(tid) , ngthrroeauds(pn(thgrroeaudp), | ^~~~~~~~~~~~~~~~~ s), tidInBlock(threadIdx./builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, x), group(gr/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ oup), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u64_4, ncclFuncReduceScatter, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u64_4, ncclFuncReduceScatter, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u64_2, ncclFuncReduceScatter, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u64_4, ncclFuncReduceScatter, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u64_2, ncclFuncReduceScatter, FuncMinMax, uint64_t, NCCL/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ _ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hR:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u64_4, ncclFuncReduceScatter, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidIunWorkColl().run(tid, subtn, wo11 warnings generated when compiling for gfx1100. rk); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_nccnBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ lDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u64_4, ncclFuncReduceScatter, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u64_4, ncclFuncReduceScatter, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx1030. /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u64_4, ncclFuncReduceScatter, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u64_4, ncclFuncReduceScatter, FuncMinMax, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1102. [ 87%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_u8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_u8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_u8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_u8.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, he/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cppad, mantissa; | ^ :1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h ba:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ rrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdxIn file included from .x/WARP_/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.hS:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable]Z 75 | E by _;group(b); | ^~~~~~~~~~~~~~~~~~a /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h :29:r15: \note: expanded from macro 'barrier_by_group' r29 | const int iw = ethr| eadIdx ^.x/ r_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.hWARP_SIZE; \ | : ^ 29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.hIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_grouIn file included from p(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | c/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.hon:st int 11w =: :11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h thre:a75dIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ Idx:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | In file included from const int w = tIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ hreadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uinIn file included from t/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: | barrier_by_group(); In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | 32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.hIn file included from :271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp: uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = thIn file included from r/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ eadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | baIn file included from rrier_/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cppby_group():; | ^~~~~~~~~~~~~~~~~~2 : In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | :5: warning: unused variable 'w' [-Wunused-variable] 80 | con st in t barriew = threr_adIdx.x/WARP_S/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:Z11E: ; \ | ^In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrieby_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ r_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 11 warnings generated when compiling for gfx1102. /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 11 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 11 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 11 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u8_2, ncclFuncReduceScatter, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cppst:2: eIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.hp:11S: iIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.hz:173: e_/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670 :15:= warning: = initializer order does not match the declaration order [-Wreorder-ctor] 0 ? ncclShmem.comm670 | . btid(utidf), fntShreiadsz(ntehresads[), NtidCInBClocLk(t_hrePadIRdx.Ox),T grO_oSupIM(gPrLEou]p), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncc/NClCL_STESPSh/simzeeof(mT) : ste.pSicze_)o m{ m| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ .| group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.hb:34:7u: note: fin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here f34 | S iz persims[(tiNCCd,L _PnthrReaOdTOs_S,I M&PLriE]n/gNC->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, CLw_SToEPSr/sizkeof-(T)> : csteopnSnize_) {I n| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ d| group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | nidef x,( wotrk-i>cod < subtn) RunWonnIrnkdex);C | o ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65l:5:l note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here< 65F | n r,unR ingT(t id,A nlgtoh, Proto, reads, work); | ^C OL/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hL_U:N432RO:LL78>:( note: ).ruin instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested heren( t432i | d, suibf tn(t,i In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u8_2, ncclFuncReduceScatter, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ wdo < rsk)u; b t| n ^ ) R/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cppu:nWo7rk:Co1ll:< Fn, T, RedOpnote: ,in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here A7 | DEFIlNE_gnocclD,ev FPruotnc(ReduceScatter_RING_SIMPLE_Moi, nCOMaLLx_U_NRuOL8L>_2(,). runn(cticld, sFubutnn, cwoRrke);d u| ^ c/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cppeS:cat7:1:t note: er,in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here Func7 | MinMax, uint8_t, NCCL_ADELFIGNEO_n_ccRlDIevNFunGc(,ReduceScatter_RI NNCGCL__PSROTIOMPLE_MinMax_u8_2, ncclFunc_SRIMePLdE,u 2c) e | ^S /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hc:a611:t62:t note: eexpanded from macro 'DEFINE_ncclDevFunc' r 611, | F RuunnWocrkMBaintcMh8_,t a, NlgCo,C L_pAroLtGOo_, RIuNGnro, lNCl>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreadCsL_(PRnOTtO_hSIrMPeLEa, d2)s | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611):62,: note: expanded from macro 'DEFINE_ncclDevFunc't i 611d | I n RBulnWoorckBaktc(h, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u8_2, ncclFuncReduceScatter, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(st/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hep:670:S15: iwarning: initializer order does not match the declaration order [-Wreorder-ctor] z e_ 670 | = =tid( tid0) ,? nntcclShmem.comm.buffhreaSds(inthzreaeds)s, t[idINnBlCockC(Lth_PrReaOTdIO_dxS.xI)M, PLgrEo]up/N(CgrCoLu_p)ST, | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ EPS | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_/s iz 671 | stepSize(stepSize_ == 0 ? nccleofSh(Tm) e: stempSi.ze_co) {m | m ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ .b | group(groupu ffS/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.hi:34z:7:e snote: [in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here N34 | prims(tid, nCtCL_hPROrT/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.heads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0,O_ SIMwPLEo]/NrCCkL-_S>TEPcS/osizneofn(T)I : nstedpSieze_x) {, | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | w group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.ho:34r:7:k note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here- >34 | c porimns(tnid,I ntnhredadse, &rxin)g->;prev, &ring->next, w | o ^:670r :15k:/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h- warning: >:initializer order does not match the declaration order [-Wreorder-ctor]s65e: n5:db note: uin instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here670ff , | 65 wo | r k ti- d(>rruentRcidiv)bngu,< fTnt,hf r,Re wedOoadprs(,kn-> PrrthoeretdaOo,pd sA)C,rgOL,L _ti0UdI, NROnwBLlooLrk->>(tid, nthreads, work); | ck(t ^hr eadIdx.x)/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h, group(gro:up)432, | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_: 78671 | :c on note: nI in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested herends ext ,e w432por | Sk i-> zcon en (In sde tx)ie; fp | ^ S (/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:ti65:iz5:d enote: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here_ < =65s= | u 0 rb u?tnR ninn)g(OpCC,tLi d_,AlPR gOntoTh, O_PSrerIaoMtPdLos,, EC woOr]kLL/N_)C;U CNRL O_| SL ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hLTE>:P432:78(S: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn)).r un(Rtidu, snuWobrtnk, wCorko); l | ^l /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp<:7/F:sin1ze,:of (note: T)in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here :T s, t epSR7ei | dDzEe_F)O I{p N ,| E ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ _A| group(groupnl /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.hgco:34:,c7: l note: PDin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here re34ov | tF ou n prcims((tidR, nethrde, CaOudLcseLSc,a _Ut&NteriRrnOg_LL>R-ING_SIMPLE>p_rev, M&rinig()n-.rM>unan(txeid_x, utsu8,bt_ 2w,n, wonorrkck);c- l>s F| ue ^ nndcb/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cppRufe:12duf, c:1weScatterork,-> reFuc: nvnote: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested herecb Mu12if | DnfEFM,INa E_xwnc,ocl rDeukvFi-un>rtnce8(Rd_etOp,duA cNrgCe,SCc L0,at te_AworLr_RGkING_SIMPLE_MinMax_u8_4, ncclFuncReduceScatter, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tiO_RING, N->cConnCIndLex,_ wPoRrOk-TO_>cSIonnMPLE,Ind de)2,) x)| ;nthreads(nthreads), tidInBlock(thread^ I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611d:62:x note: expanded from macro 'DEFINE_ncclDevFunc'. x611 | ) ,Run WorgkBartch, algo, pro t| o, ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h :65u:5n: note: rin instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here o65 | l lrun>Rin(g)(\tid , n thr| eads, ^ wo rk)/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h; | ^:gr 670ou/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:p15),: :| ^~~~~~~~~~~note: 432 field 'nthreads' will be initialized after field 'tidInBlock': 78 : 670 | note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~432 | if (tid < subtn) RunWorkColl /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(gro(u).pr)un,(t id ,| s ^~~~~~~~~~~ub tn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u8_4, ncclFuncReduceScatter, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u8_2, ncclFuncReduceScatter, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u8_4, ncclFuncReduceScatter, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u8_4, ncclFuncReduceScatter, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u8_4, ncclFuncReduceScatter, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u8_4, ncclFuncReduceScatter, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u8_2, ncclFuncReduceScatter, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u8_2, ncclFuncReduceScatter, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u8_2, ncclFuncReduceScatter, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor]/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u8_4, ncclFuncReduceScatter, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes11 warnings generated when compiling for host. [NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_MinMax_u8_2, ncclFuncReduceScatter, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u8_2, ncclFuncReduceScatter, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u8_4, ncclFuncReduceScatter, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u8_4, ncclFuncReduceScatter, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u8_4, ncclFuncReduceScatter, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_minmax_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_MinMax_u8_4, ncclFuncReduceScatter, FuncMinMax, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 12 warnings generated when compiling for gfx90a. [ 87%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ : warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145In file included from | uint32_/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ , data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.hIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_groIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ up(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf16_2, ncclFuncReduceScatter, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf16_2, ncclFuncReduceScatter, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf16_2, ncclFuncReduceScatter, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclSh(nthremads), tiedInBlockm(threadI.dx.x), gcroup(grooup), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~m | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671m | st.epSize(stbepSize_ =u= 0 ? ncfclShmefm.comm.buffSizes[NCCL_PRSOTO_SIMPLiE]/NCCL_SzTEPS/sizes[NCCL_PROTO_SIMPLEeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: ]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreanote: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here d34 | s p,rims(t id, nt&hreads,r &ringi->prev,n &ringg->next-, work->>sendbpuff, worrk->reecvbuffv, work-,>redO pArg, &0, worrk-ing->next, work->sendbuff, work->recvbuff, work->r>coennIdndex, Oworkp->conAnIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.hr:65:5:g , 0, work->connIndnote: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRingconnIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tOid, nLthreLads, wo_rk); U| ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hN:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here R 432 | O ifL (Lt>i(tid, nthrd < subten) adRus,n WorwkCorolkl)<;F n | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if , T,( RedOtp, Aligo, Prdoto, COLL_UNRO LL>(<) subtn) RunWorkColl, 1, 2, 2>::run' requested here 7 | DProto, COLL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScattEFINeE_ncclrDev_FunRc(ReIducNeScaGtter_RI_NG_SIMSPLE_PreMIulSumM_bf16_P2, nccLlFuncREeduceS_catterP, FuncrPreMuleSum, hMip_bflouat1l6, NCCSLum_bf16_4, ncclFuncReduceScatter, FuncPreMulSum, hi_ALpGO_RING, NC_CL_PROTO_SIbMPLE, 2) f | ^ l/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.ho:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | Runat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ WorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf16_4, ncclFuncReduceScatter, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf16_2, ncclFuncReduceScatter, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_PreMulSum_bf16_2, ncclFuncReduceScatter, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf16_2, ncclFuncReduceScatter, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested hereIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, w if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf16_2, ncclFuncReduceScatter, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, ork->recvbuff, work->redOpArg, 0, work2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ->In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: connIndeIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf16_2, ncclFuncReduceScatter, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWox, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:rkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf16_2, ncclFuncReduceScatter, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , RedOp, Proto, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf16_2, ncclFuncReduceScatter, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T 670 | tid(tid), nthreads(nt) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf16_2, ncclFuncReduceScatter, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ hreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connInd/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/size(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf16_4, ncclFuncReduceScatter, FuncPreMulSum, hip_bfloat16, NCCL_ALof(GT) : stepSiOze_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~_ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.hR:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested hereI N34 | priGms(tid,, nth reads,N &rinCg->prCev, L&ring-_>nextP,ROT woO_SIrk->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5MPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidI:n note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here B65 | l runRing(tidr, nthereadsa, wordk); I| ^ d/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: xnote: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here .432 | x ) , group(if (tid < subtn) RunWorkColl().run(tid, subtn, workgroup), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf16_4, ncclFuncReduceScatter, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf16_4, ncclFuncReduceScatter, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf16_4, ncclFuncReduceScatter, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670RunWorkBatch, a:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] lgo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | 670 | tid(tid), nthreads(nthread tis), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ d(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf16_4, ncclFuncReduceScatter, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf16_4, ncclFuncReduceScatter, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^| ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx./builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadI 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf16_4, ncclFuncReduceScatter, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:dx.x), group(group), | ^~~~~~~~~~~ 62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf16_4, ncclFuncReduceScatter, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf16_4, ncclFuncReduceScatter, FuncPreMulSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx1100. 12 warnings generated when compiling for gfx90a. [ 87%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ :15: note: expanded from macro 'barrier_by_group' 29 | c/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ onst int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 145 | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 14535: warning: unused variable 'flag2' [-Wunused-variable] 145 | | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, uint3 2_t data1, flag1, data2, flag2; | ^~~~~ dIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadata2,I flag2; d| ^~~~~ x.x/WARP_SIZEIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1,; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group();In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | cIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ onst int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.xIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ uint32_t data1, fl/WAaRP_SIZEg; \ | ^ In file included from 1, data2,/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | In file included from uint32_t data1, flag1, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128OfIn file included from fset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threa/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from d/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.hI:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271d:19: warning: unused variable 'ptr' [-Wunused-variable] x 271 | . uxint64_/t* pWtr = rAecvPtrR(0)+ll1P28Offs_et; | S ^~~ IZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf8_2, ncclFuncReduceScatter, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf8_2, ncclFuncReduceScatter, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf8_4, ncclFuncReduceScatter, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_nBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf8_4, ncclFuncReduceScatter, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ cclDevFun/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from c/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h(:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:R15educeS: warning: initializer order does not match the declaration order [-Wreorder-ctor] c 670 | a tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEtter_RING_SIMPLE_PreMulSum_bf8_2, ncclFuncReduceScatter, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here d34 | o primsp(ti, algo, proto, ->preuv, &rin/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.hg:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_PreMulSum_bf8_2, ncclFuncReduceScatter, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ ->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^nroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads) /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h, :65:5t: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested herei 65d | IrunRningB(tird, nethreaads, dwIork)d; | x ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h.:432x), group(group):78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf8_2, ncclFuncReduceScatter, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PRO11T warnings generated when compiling for host. O_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(RIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, neduceScatter_RING_SIMPLE_PreMulSum_bf8_2, ncclFuncReduceScatter, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ threads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf8_2, ncclFuncReduceScatter, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nT, RedOp, Algo, Proto, COLL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf8_2, ncclFuncReduceScatter, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), ntthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ hreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf8_2, ncclFuncReduceScatter, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidIn file included from InBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11 : In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:t173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:i670:15:d warning: (initializer order does not match the declaration order [-Wreorder-ctor] t670 | i tidd(ti)d), nthreads,(nth readns), ttidIhnBlorck(tehreadaIdx.dx), sgr(nthreads), tidInoup(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | Block(threadIdx.x), group(group), | ^~~~~~~~~~~ tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf8_2, ncclFuncReduceScatter, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ epSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf8_2, ncclFuncReduceScatter, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf8_4, ncclFuncReduceScatter, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf8_2, ncclFuncReduceScatter, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf8_4, ncclFuncReduceScatter, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : ste:670:15p: warning: initializer order does not match the declaration order [-Wreorder-ctor] S670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMize_)P { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ L| group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34E:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here ]34 | / primsN(tid, ntChreads, C&ring->pLrev,_STEPS/sizeof(T) : step &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65Si | ze_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h :34: 7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here r 34 | u pnrims(tiRd, nthireads, &ring->prev, &ring->nex/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.ht, work->sendbuff, work-:670:15>: warning: initializer order does not match the declaration order [-Wreorder-ctor] r670 | ecvbuff, wor tidk(tid), n-t>redhreOadngidILcn>(oBtidnl, nnothrIceadnks, d(woretk);xh | ,r ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h e:wa432:78od:rI note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested herekd -x432 | .> xc i)fo (,tnid nIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65gr:oup(5grou:p), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ note: | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here < subt671 n) | Run65 Wo | rkC oll (unRi)ng12(tid:, nt1hrea:ds, worknote: ); in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here| ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h :432 :78cl:Shme m12note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here | . DcoEm432m.b | uff Siz es[ NCC L_P ROT Oi_SIfMPL E]/(NCCtL_SiTEPdS/s izeo, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested hereg o34G_, | SI MPP Lr Eo _Pt reo Mu,plS rumCi_ObfmL8_sL4, _(ncUtcliNROLL>().run(tidd,, nt hresadsu, &brintg->npre,Fuv nc,wRe od&urcrekSic)ant;tge r-, >F| unn ^ceP rxe/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpptMul,Su: m,12w r:occ1rl_:kbf -lonote: >atin instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here8 s ,12 | DEFendIbufNf, E NC_wCL_noALrGO_RkIcc-NlD>Grec,v NbeCvuFCufnLcf(_R, worPROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.heduce:Sca611tte:r_R62ING:_SI MPnote: LE_Pexpanded from macro 'DEFINE_ncclDevFunc'reM ulS uk-m>611_re | bd fOp 8Ar _g, 4 0R,, u wonnrkWco->cronnkcInBldeaFx,tu wocnrkhc->, ProtoSimple<2, 2, 4>, 4>' requested here 65 | N C rCuLnRi_ngt, aolgGo,, p, ro NtoCC, OCunLLroL_ll_P>(UR).NOruTRn(OO);L_ \LS I> (M| ^tP /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' iL Ed,,670 | n4 )t h r | et^ai dd/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hs,( ti:wd611o):r,62k :)n ;tnote: expanded from macro 'DEFINE_ncclDevFunc'h | r ^ea 611/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h | d s( :n 432t :hR78ru:en aWnote: doin instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested heresr )k B, a432tt | ic dh I< nc Bo ll olic,fk (t(tyth,ir der ade) , gR ruaonlWuop(rggkorC,oo ulpplro<)tF,on ,, | u ^~~~~~~~~~~~~~~~~T n,/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hro R:le670ld:>O60(p:), .note: r field 'group' will be initialized after field 'stepSize'uA nl (g670o, | ) ;P r \o tt oi| ,d ^ ( Ct/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hiOdLL):_,670: UNn15Rt: hOnote: rLfield 'nthreads' will be initialized after field 'tidInBlock'eL a> d(s)670(. | nr tu hn r( tetaiiddds, )(s,tu ibtdti)nd,,I n Bwnlotorhckrk)(ea; /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmedt sh| r( ^neat dh/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpprIdx.ex:a)12d,:s 1)g:,r onote: tuin instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested hereip d I(gnr12Bo | loup)cDkE, (FtI | hN ^~~~~~~~~~~rEm .comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf8_4, ncclFuncReduceScatter, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ e_andIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | ticdcl(DtevFiundc()Re,duceS canttterh_RrIeNGa_SdIMsPLE(_PnretMuhrlSeuma_bdf8s_4), ,nc cltFuincdReIdunceBSclaottcerk, (FutnchPreMulSum, rccl_bfloraeatdIdx.x8),, NCCLgroup(group), | ^~~~~~~~~~~ _ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf8_4, ncclFuncReduceScatter, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf8_4, ncclFuncReduceScatter, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_bf8_4, ncclFuncReduceScatter, FuncPreMulSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx906. 12 warnings generated when compiling for gfx90a. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx942. [ 88%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_f16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_f16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_f16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_f16.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t datIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ a1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f16_2, ncclFuncReduceScatter, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f16_4, ncclFuncReduceScatter, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f16_2, ncclFuncReduceScatter, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | In file included from tid(tid),/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp: 2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:n11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:t173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670hreads(nthreads), tidInB:15:l warning: initializer order does not match the declaration order [-Wreorder-ctor] o 670c | ktid(t(id), ntthreadIhdreadsx(nt.x), group(grouphreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ ==), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->p 0 ?r nccleShmemv.com,m.buf fSize&s[NCCL_PROTOr_SIMPiLE]/NCnCL_STgEPS/sizeof(T)- : ste>pSizen_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recext, work->senvdbufbf, wourk->refcvbufff, wo,rk-> redOwpArg,o 0, rwork-k>conn-Index, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRingredO,pArg, 0, woCrk->OconnInLdex,L_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f16_2, ncclFuncReduceScatter, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ oto, COLL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f16_2, ncclFuncReduceScatter, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_PreMulSum_f16_2, ncclFuncReduceScatter, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ 11 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f16_2, ncclFuncReduceScatter, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f16_4, ncclFuncReduceScatter, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) In file included from | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/siz tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ eof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f16_2, ncclFuncReduceScatter, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f16_2, ncclFuncReduceScatter, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nt/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f16_4, ncclFuncReduceScatter, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x)hreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f16_2, ncclFuncReduceScatter, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f16_2, ncclFuncReduceScatter, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f16_2, ncclFuncReduceScatter, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f16_4, ncclFuncReduceScatter, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f16_4, ncclFuncReduceScatter, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f16_4, ncclFuncReduceScatter, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f16_2, ncclFuncReduceScatter, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f16_4, ncclFuncReduceScatter, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 4>, 4>' requested here/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | ste p65 | SrunRinig, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f16_4, ncclFuncReduceScatter, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ sLL>(ttid, nthreeads, wpork); | S ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cppize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->:12:1: nnote: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12e | DEFINE_xncclDevtFunc(Red,uceSca tter_RwING_SIMoPLE_PrerMulSum_kf16_4, n-cclFunc>sRendbuff, work->recvbuff, workeduceScatter, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch<->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f16_4, ncclFuncReduceScatter, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ coll, ty, redop, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f16_4, ncclFuncReduceScatter, FuncPreMulSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx1102. 12 warnings generated when compiling for gfx90a. [ 88%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_f32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_f32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_f32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_f32.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp 12 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; In file included from | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable]In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | b 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const aint w = rthreadIdxr.x/WARier_P_SIby_group(); | ^~~~~~~~~~~~~~~~~~ ZE; \/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h | ^ :29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | bIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | arrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ ;In file included from | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | ui/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11n: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.ht:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint332_t dat2a1, fla_g1, data2, flag2; | t ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145: 21: warning: data1, flag1, daunused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: ta2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uintunused variable 'data2' [-Wunused-variable] 1453 | ui2nt32_t d_ata1, tfla data1, flag1, data2, flagg2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h1:145:35: ,warning: unused variable 'flag2' [-Wunused-variable] 145 | uint3d2_t dataa1ta2, flag2; | ^~~~~ , flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5:In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | 29 :15: note: expanded from macro 'barrier_by_group'c 29 | o In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); cnonsts in| ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ t w = thretadId int w = threax.x/WAdRP_SIZIE; \dx.x/WARP_SIZE; \ | ^ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ warning: unused variable 'data1' [-Wunused-variable] 145 | uint3In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 2_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2In file included from : In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670::15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 173 : /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | 670 | tid (tid), nt hreads(n thretid(tid), nthreads),a tidInBlock(ds(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.cnthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLEomm].buffSiz/es[NCCLN_PROTO_SCIMPLE]C/NCCL_STELPS/sizeo_STEPS/sizeof(Tf(T) ): step Size_: stepSiz) { e | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~_ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h): 34:7: note: {in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7 34 | prims:(tid, note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | pnthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0rims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRingpconn,Inde x, wPork-r>connoIndetx); o | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h,: 65:5:C note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here O65 | L rLunRi_ngLL>((tid,t nthireadsd, wo,rk) ; | n ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hthreads, wo:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINOLLE>()_.runn(tcidc, slubtnD, weorkv); F | ^ u/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cppn:7:c1: note: (in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested hereR e7d | DEuFINcE_enccSlDevcFunac(tRedutceSecratt_er_RRINIG_SNIMPGLE_P_reMSulSIum_Mf32PLE_PreMulSum_f32_2, nc_2, ncclFuncReduceScatter, FuncPreclMFunucRelducSeScuam, flotat, NCCL_ALGO_RING, teNr, CFuncCPreMLulS_um,P flRoatO, NTCCLO_AL_GO_SRINIG,MPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hNCCL_P:R611OTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBa:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f32_2, ncclFuncReduceScatter, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f32_2, ncclFuncReduceScatter, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.hIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f32_2, ncclFuncReduceScatter, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ arp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_PreMulSum_f32_2, ncclFuncReduceScatter, FuncPreMulSum, float, NCCL_ALGO_RING/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | , NC C L_PR OTO_L L128, t2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62:i note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run()d; \ | ^ (tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f32_4, ncclFuncReduceScatter, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f32_2, ncclFuncReduceScatter, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f32_4, ncclFuncReduceScatter, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f32_4, ncclFuncReduceScatter, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f32_2, ncclFuncReduceScatter, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdxnthreads(nthreads), t.ix), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_dInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmencclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f32_4, ncclFuncReduceScatter, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | m.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSiz tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), e_) {nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f32_2, ncclFuncReduceScatter, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f32_4, ncclFuncReduceScatter, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cppS:2: In file included from c/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreadatter_RING_SIMPLE_PreMulSum_f32_4, ncclFuncReduce/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Scatter, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatsc(nthhreads<), tcidInoBloclk(thlread,Idx. x), tgrouyp(gr,oup) , | r ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_e 671d | op, algo , st epSpize(rsteopSitze_o ==, 0 ? nucclnShmrem.ocomlm.bluff>Siz(es[)NCC.L_rPROTO_SuIMPnLE](/NC)CL_;STE PS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | pr\ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nitms(htid,r ntehreaadsd, &srin(g->npretv, h&rirnge->naextd, wsork)->s,end btuffi, wdorkI->rnecvbuff,B wlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ork->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f32_2, ncclFuncReduceScatter, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ?:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here n7 | DEFIcNE_nccclDevFunlShc(RmeduceeScattmer_RING._SIMPLcE_PreMuolSum_fm32_2, ncclFuncReduceSmcatter, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatc.bhuffSizes[NCCL_PROTO_, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff,dop, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f32_4, ncclFuncReduceScatter, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f32_2, ncclFuncReduceScatter, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f32_4, ncclFuncReduceScatter, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f32_4, ncclFuncReduceScatter, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f32_4, ncclFuncReduceScatter, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f32_4, ncclFuncReduceScatter, FuncPreMulSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1100. [ 88%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_f64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_f64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_f64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_f64.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t77 | uyint32_,t y, heahd, manetissaa; | ^ d, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx1200. 12 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadI:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = thredax.x/WdARP_ISIZEd; \ x| ^ .x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group'/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_ 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, fIn file included from lag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h: 145 | 173 : uint/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h32_In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :75:7: warning: unused variable 'w' [-Wunused-variable] 75 | t da ta1, fl ag1, data2, flagbarrier_by_groIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ up(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+llIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_grouIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ p(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14:In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] [ 88%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_f8.cpp.o 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_f8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_f8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_f8.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f64_2, ncclFuncReduceScatter, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f64_2, ncclFuncReduceScatter, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f64_2, ncclFuncReduceScatter, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { :670:| 15 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | : warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid( group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreadtid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? s, &ring-n>prev, &cring->necxt, workl->sendbuSff, workh->recvbumff, worke->redOpAmrg, 0,. work->cconnIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f64_2, ncclFuncReduceScatter, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatIndex,o work-mm.>bconuffSizes[NCCL_PRnIndOex); T| ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.hO:79:5:_SIMPLE]/NCCL_STEPS /note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79s | ruinRing(tid, nthreads, wor | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &rcih, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ng->next, work->sendbuff, work->reck)v; | ^ b/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | u if f(tid redOpArg, 0, work->connIndex, work->connIndex); Algo , Pr| oto, ^ COLL_UNROLL >()/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h.run(tid,: sub65tn, :work5); :| ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp :5note: :1: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested herenote: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | D EFINE_n65ccl | DevFu nc(R edu runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 2, nc432clFu | ncRe duce Scat ter, FuncPreMul Sum , doiublef, NC CL_A(LGO_tRINGi, NCdC < subtn) RunWorkColl().run(tid, subtn, work); L _PR| OTO_ ^LL12 8, 2/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h::611:62:12 note: expanded from macro 'DEFINE_ncclDevFunc' : 611 | 1 R:unWo rkBanote: tch, 1, 2, 4>::run' requested herecol l, t y, redop<12ty>, | alDgo, EprotFo, uInroNll>(E).ru_n()n; \ c | ^ clDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f64_4, ncclFuncReduceScatter, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f64_2, ncclFuncReduceScatter, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthr/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f64_4, ncclFuncReduceScatter, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ eads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvIn file included from buff, work->redOpArg, 0, work->conn/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cppInd:ex, w2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11ork: ->In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tconnIndiex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | rd), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.bufunRing(teid, nthrseads, w[ork); N| ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432C:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here C432 | L _if (tPid < suRbtn) RuOnWorkCToll().run(stid, stepSize_) { u btn, work); | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h| ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFIN:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->senE_ndcclbDevFunuc(RefduceSfcatter,_RING _SIMPwLE_ProeMulSrum_f6k4_2, -ncclF>uncRerduceSecattecr, Fuvncbuff, work->redOpArg, 0, worPkreMul-Sum, >doublce, NCoCL_ALGnO_RInNG, NIndex,CC L_PROwTO_ork->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.hSIMP:LE,65 2) :| ^ 5/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611::62: note: expanded from macro 'DEFINE_ncclDevFunc' note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | 611 | runRing, , COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < algso, upbrotto, nunro)ll>().run( ); R\ u | n ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hW:670o:15:r note: field 'nthreads' will be initialized after field 'tidInBlock'k C670 | o tlid(ltid<), Fnthnrea,ds( nthTread,s ), tidInRBloeck(dOp, Algo, Proto, COLL_UNROLL>().thrreaudIdnx(.x)t, giroupd(g,rou p),s | u ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hb:670:t60: nnote: field 'group' will be initialized after field 'stepSize' , work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_n670c | c tlid(Dtide), vnthFreauds(nnthcreads), tidInBlock(threadIdx.(ReduceScatter_RING_SIMPLE_Prxe), Mgrouup(lgroSum_f64_2, ncclFuncReduceScatter, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_upS),I M| ^~~~~~~~~~~ PLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] (tid), 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f64_4, ncclFuncReduceScatter, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15(: warning: initializer order does not match the declaration order [-Wreorder-ctor] ) 670. | rtid(tiud), nnthrea(ds(ntthreadis), tdidInB,loc subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:12:1: note: k(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIin instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f64_4, ncclFuncReduceScatter, FuncPreMulSum, double, NCCL_ALGO_RING, NCCLMP_LE]/NPCCL_SRTEPS/OsizeoTf(T) O: ste_pSize_S) { I| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | M group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.hP:34:7: Lnote: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here E34, 4) | | ^ pri/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' ms(t id, nthr611 | RunWoeadsr, &rikng->pBraetvchnext, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->cedop, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' onnInde670x); | | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h :65 :5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | t irunRding<(T, RtediOp, Pdroto), ,COLL _UNRnOLL>t(tidh, ntrheads(ntreahds, rwoeads), tidInBlock(threadIdrk); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f64_2, ncclFuncReduceScatter, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f64_2, ncclFuncReduceScatter, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f64_4, ncclFuncReduceScatter, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads),In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work-> tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSizec_onnIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthr == 0 ? ncclShmem.comm.eads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f64_2, ncclFuncReduceScatter, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f64_4, ncclFuncReduceScatter, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f64_2, ncclFuncReduceScatter, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f64_4, ncclFuncReduceScatter, FuncPreMulSum, dLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f64_4, ncclFuncReduceScatter, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ouble, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f64_4, ncclFuncReduceScatter, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f64_4, ncclFuncReduceScatter, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f64_4, ncclFuncReduceScatter, FuncPreMulSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7:In file included from warning: unused variable 'w' [-Wunused-variable] 75 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp: 2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7 : warning: unused variable 'w' [-Wunused-variable] 75 | barrbier_by_graoup(); | r ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15rier_b:y note: expanded from macro 'barrier_by_group' 29_group(); | ^~~~~~~~~~~~~~~~~~ | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h const int w = threadIdx.x/WAR:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ P_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ : /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp :2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: dIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:a14: warning: unused variable 'data1' [-Wunused-variable] t 145 | auint32_t 1data1, fl,ag1, dat a2, flag2;f | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.hl:145:21:ag warning: 1unused variable 'flag1' [-Wunused-variable] , data2, 145 | uiflag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:nt32_t data1, 145fla:28: warning: unused variable 'data2' [-Wunused-variable] 145 | g1, data2, f lag2;uint32_t | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | c bonst int w = threadIdxIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ .x/WARP_SIZE; \ | ^ arrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173In file included from : /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | con/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from s/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: t int w = thrIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: e/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: awarning: unused variable 'w' [-Wunused-variable] dI75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15dx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from :2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const in/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: t w =warning: threadunused variable 'data1' [-Wunused-variable]Idx.x/ WARP _145 | uiSIZE; \ | ^ nt32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uinIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ t32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1In file included from , data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ Idx.x//builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f8_2, ncclFuncReduceScatter, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f8_2, ncclFuncReduceScatter, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f8_4, ncclFuncReduceScatter, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f8_2, ncclFuncReduceScatter, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f8_4, ncclFuncReduceScatter, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_PreMulSum_f8_2, ncclFuncReduceScatter, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f8_2, ncclFuncReduceScatter, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f8_2, ncclFuncReduceScatter, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f8_2, ncclFuncReduceScatter, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f8_2, ncclFuncReduceScatter, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f8_4, ncclFuncReduceScatter, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f8_2, ncclFuncReduceScatter, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f8_2, ncclFuncReduceScatter, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f8_4, ncclFuncReduceScatter, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f8_4, ncclFuncReduceScatter, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f8_4, ncclFuncReduceScatter, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f8_4, ncclFuncReduceScatter, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f8_4, ncclFuncReduceScatter, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f8_4, ncclFuncReduceScatter, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBat/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f8_2, ncclFuncReduceScatter, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f8_2, ncclFuncReduceScatter, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f8_4, ncclFuncReduceScatter, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_f8_4, ncclFuncReduceScatter, FuncPreMulSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx942. 12 warnings generated when compiling for gfx90a. [ 89%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_u32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_u32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_u32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_u32.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, hIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ ead, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ : In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u32_2, ncclFuncReduceScatter, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(g/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_PreMulSum_u32_2, ncclFuncReduceScatter, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ roup), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u32_2, ncclFuncReduceScatter, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u32_2, ncclFuncReduceScatter, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u32_4, ncclFuncReduceScatter, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u32_2, ncclFuncReduceScatter, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u32_2, ncclFuncReduceScatter, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u32_4, ncclFuncReduceScatter, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u32_4, ncclFuncReduceScatter, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u32_4, ncclFuncReduceScatter, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u32_4, ncclFuncReduceScatter, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u32_2, ncclFuncReduceScatter, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u32_2, ncclFuncReduceScatter, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u32_2, ncclFuncReduceScatter, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u32_2, ncclFuncReduceScatter, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u32_2, ncclFuncReduceScatter, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u32_2, ncclFuncReduceScatter, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u32_4, ncclFuncReduceScatter, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u32_4, ncclFuncReduceScatter, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u32_4, ncclFuncReduceScatter, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u32_4, ncclFuncReduceScatter, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u32_4, ncclFuncReduceScatter, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u32_4, ncclFuncReduceScatter, FuncPreMulSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx906. 12 warnings generated when compiling for gfx90a. [ 89%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_u64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_u64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_u64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_u64.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 12 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ .x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, 11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from data2, flag/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp2:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, ; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u64_2, ncclFuncReduceScatter, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u64_2, ncclFuncReduceScatter, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_PreMulSum_u64_2, ncclFuncReduceScatter, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u64_2, ncclFuncReduceScatter, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u64_4, ncclFuncReduceScatter, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u64_2, ncclFuncReduceScatter, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u64_2, ncclFuncReduceScatter, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u64_2, ncclFuncReduceScatter, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u64_4, ncclFuncReduceScatter, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u64_4, ncclFuncReduceScatter, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u64_4, ncclFuncReduceScatter, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u64_4, ncclFuncReduceScatter, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u64_2, ncclFuncReduceScatter, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u64_2, ncclFuncReduceScatter, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u64_4, ncclFuncReduceScatter, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:1511 warnings generated when compiling for host. : warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u64_4, ncclFuncReduceScatter, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u64_4, ncclFuncReduceScatter, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u64_2, ncclFuncReduceScatter, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u64_2, ncclFuncReduceScatter, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u64_2, ncclFuncReduceScatter, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u64_4, ncclFuncReduceScatter, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u64_4, ncclFuncReduceScatter, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u64_4, ncclFuncReduceScatter, FuncPreMulSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx908. [ 89%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_u8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_u8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_u8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_u8.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx906. 12 warnings generated when compiling for gfx90a. [ 89%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_bf16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_bf16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_bf16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_bf16.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u8_2, ncclFuncReduceScatter, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u8_2, ncclFuncReduceScatter, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u8_2, ncclFuncReduceScatter, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u8_2, ncclFuncReduceScatter, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u8_4, ncclFuncReduceScatter, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u8_4, ncclFuncReduceScatter, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u8_2, ncclFuncReduceScatter, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u8_2, ncclFuncReduceScatter, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u8_4, ncclFuncReduceScatter, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WA/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u8_4, ncclFuncReduceScatter, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ RP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_PreMulSum_u8_2, ncclFuncReduceScatter, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u8_2, ncclFuncReduceScatter, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u8_2, ncclFuncReduceScatter, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u8_2, ncclFuncReduceScatter, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u8_4, ncclFuncReduceScatter, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u8_4, ncclFuncReduceScatter, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u8_2, ncclFuncReduceScatter, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u8_2, ncclFuncReduceScatter, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u8_4, ncclFuncReduceScatter, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u8_4, ncclFuncReduceScatter, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u8_4, ncclFuncReduceScatter, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u8_4, ncclFuncReduceScatter, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_premulsum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PreMulSum_u8_4, ncclFuncReduceScatter, FuncPreMulSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | consIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ t int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ : In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ , data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const inIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ t w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf16_2, ncclFuncReduceScatter, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf16_2, ncclFuncReduceScatter, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf16_2, ncclFuncReduceScatter, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h 671 | stepSiz:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthree(stepSiaze_ == 0d ? ncclSshmem.comm(.buffSinzes[NCCL_tPROTO_SIhreads), wid(tid%WARPMPLE_]/NCCLS_STEPS/Isizeof(ZT) : steEpSize_)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~, warp(tid/W | A group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:R34:7P_S: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, IZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(groupIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf16_2, ncclFuncReduceScatter, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSizework-(>sendbnuff, wcork->rcecvbuflf, workS->redOhpArg, 0m, work-e>connImndex, .work->cconnInodex); m | ^ m.buffSizes[NCCL_PROTO_LL/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuf:432f:78: note: ,in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | w iof (trid RunWorrkeColl(,).ru n(tidw, suobtn,r workk); - | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp>:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_PconnIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (rtod_ibf16_d2, n cclF, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_Prod_bf16_2, ncclFuncReduceScatter, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf16_2, ncclFuncReduceScatter, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | In file included from prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work-/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: >In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.hr:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hecvbuff, work->re:670:15:dOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf16_2, ncclFuncReduceScatter, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf16_2, ncclFuncReduceScatter, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf16_2, ncclFuncReduceScatter, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf16_4, ncclFuncReduceScatter, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hSTEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ : 670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] | 670 | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf16_4, ncclFuncReduceScatter, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(grou tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf16_4, ncclFuncReduceScatter, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ p), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf16_2, ncclFuncReduceScatter, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuf tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ f, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf16_4, ncclFuncReduceScatter, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthread stid(tid),) nthreads(,nthreads), tidInBlotck(thridInBloceadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_P/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf16_4, ncclFuncReduceScatter, FuncProdROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf16_2, ncclFuk, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), (threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf16_4, ncclFuncReduceScatter, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ncReduceScatter, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nt/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuhreads, &ring->prff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf16_4, ncclFuncReduceScatter, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf16_4, ncclFuncReduceScatter, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf16_4, ncclFuncReduceScatter, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf16_4, ncclFuncReduceScatter, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf16_4, ncclFuncReduceScatter, FuncProd, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx906. 12 warnings generated when compiling for gfx90a. [ 90%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_bf8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_bf8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_bf8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_bf8.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx906. 12 warnings generated when compiling for gfx90a. [ 90%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_f16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_f16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_f16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_f16.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group();In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ unused variable 'w' [-Wunused-variable]In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf8_2, ncclFuncReduceScatter, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf8_2, ncclFuncReduceScatter, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf8_2, ncclFuncReduceScatter, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf8_2, ncclFuncReduceScatter, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf8_4, ncclFuncReduceScatter, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf8_2, ncclFuncReduceScatter, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf8_4, ncclFuncReduceScatter, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf8_2, ncclFuncReduceScatter, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf8_4, ncclFuncReduceScatter, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf8_2, ncclFuncReduceScatter, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf8_4, ncclFuncReduceScatter, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf8_2, ncclFuncReduceScatter, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf8_2, ncclFuncReduceScatter, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf8_4, ncclFuncReduceScatter, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf8_4, ncclFuncReduceScatter, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf8_2, ncclFuncReduceScatter, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf8_4, ncclFuncReduceScatter, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf8_4, ncclFuncReduceScatter, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf8_4, ncclFuncReduceScatter, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_Prod_bf8_2, ncclFuncReduceScatter, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf8_2, ncclFuncReduceScatter, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf8_4, ncclFuncReduceScatter, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_bf8_4, ncclFuncReduceScatter, FuncProd, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_groIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ up(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ , flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f16_2, ncclFuncReduceScatter, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f16_2, ncclFuncReduceScatter, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f16_4, ncclFuncReduceScatter, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f16_4, ncclFuncReduceScatter, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f16_2, ncclFuncReduceScatter, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f16_2, ncclFuncReduceScatter, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f16_2, ncclFuncReduceScatter, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f16_2, ncclFuncReduceScatter, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f16_4, ncclFuncReduceScatter, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f16_4, ncclFuncReduceScatter, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f16_2, ncclFuncReduceScatter, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f16_4, ncclFuncReduceScatter, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f16_2, ncclFuncReduceScatter, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ : In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f16_2, ncclFuncReduceScatter, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_Prod_f16_2, ncclFuncReduceScatter, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f16_2, ncclFuncReduceScatter, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f16_2, ncclFuncReduceScatter, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f16_4, ncclFuncReduceScatter, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f16_4, ncclFuncReduceScatter, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(n/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f16_4, ncclFuncReduceScatter, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ threads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f16_4, ncclFuncReduceScatter, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f16_4, ncclFuncReduceScatter, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f16_4, ncclFuncReduceScatter, FuncProd, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1030. 12 warnings generated when compiling for gfx90a. [ 90%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_f32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_f32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_f32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_f32.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp 12 warnings generated when compiling for gfx90a. 11In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:1: warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7:In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ :271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 11 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ [ 90%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_f64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_f64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_f64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_f64.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f32_2, ncclFuncReduceScatter, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f32_2, ncclFuncReduceScatter, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_Prod_f32_2, ncclFuncReduceScatter, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f32_2, ncclFuncReduceScatter, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f32_2, ncclFuncReduceScatter, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f32_4, ncclFuncReduceScatter, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hstepS:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f32_2, ncclFuncReduceScatter, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f32_4, ncclFuncReduceScatter, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f32_4, ncclFuncReduceScatter, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f32_4, ncclFuncReduceScatter, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ threads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f32_2, ncclFuncReduceScatter, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkCollIn file included from ().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f32_4, ncclFuncReduceScatter, FuncProd, float, NCCL_ALGO_RING, NC sCtepSizeL(stepSi_ze_ == 0P ? ncclRShmem.cOomm.buffTSizes[NOCCL_PROT_O_SIMPLSE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, IMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInworBk->senldbuff, owork->crecvbukff, wor(k->rtedOpArg, 0, workh->connrIndex,e work-a>connIdndex); I | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.hd:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here x.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f32_2, ncclFuncReduceScatter, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.xMPLE)]/NCC,L_S TEPS/sizeogf(Tr) : sotepSizue_) {p | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ ( | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.hgr:34:o7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here u 34 | p ) p, | ^~~~~~~~~~~ rims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f32_2, ncclFuncReduceScatter, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0,/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Redu work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65ceScatter_RING_SIMPLE_Prod_f32_4, ncclFuncReduceScatter, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f32_4, ncclFuncReduceScatter, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f32_4, ncclFuncReduceScatter, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f32_2, ncclFuncReduceScatter, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIIn file included from MPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRin/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:g2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_P(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f32_2, ncclFuncReduceScatter, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f32_2, ncclFuncReduceScatter, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, a/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: lgo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f32_4, ncclFuncReduceScatter, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthre 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f32_4, ncclFuncReduceScatter, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f32_4, ncclFuncReduceScatter, FuncProd, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp::2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h: 11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:warning: 75:7: warning: unused variable 'data2' [-Wunused-variable]unused variable 'w' [-Wunused-variable] 75 | barr ier_by_group()145; | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h | :29:15: note: expanded from macro 'barrier_by_group' 29 | cons t in t uwint32_t =d threadIadxt.a1, flx/WARPa_gS1I,Z Ed; \ | ^ In file included from ata2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h ^~~~~:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75 :7: warning: unused variable 'w' [-Wunused-variable]/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h 75 | : b145arri:e28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_tr_b y_dgroup(a); | ^~~~~~~~~~~~~~~~~~t /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:a151: note: expanded from macro 'barrier_by_group', flag1, data2,29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint3In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 2_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tiO_LL1d28]/NCCL(_STEPS/sizteof(uint64i_t)) { d| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group) /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h,:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here nthreads(nthreads),34 | primst(tid, ntihreads, &ring->prev, &ring->next,In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subt work->sendbudInBflock(thfread,Idx.x), g roup(growup), | o ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ r| tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | k stepSi-ze>recvbuff, work->redOpArg, 0, work->connIn(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f64_2, ncclFuncReduceScatter, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ dex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f64_2, ncclFuncReduceScatter, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ LL128, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_Prod_f64_2, ncclFuncReduceScatter, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f64_2, ncclFuncReduceScatter, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f64_2, ncclFuncReduceScatter, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f64_4, ncclFuncReduceScatter, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f64_2, ncclFuncReduceScatter, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f64_4, ncclFuncReduceScatter, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f64_4, ncclFuncReduceScatter, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f64_2, ncclFuncReduceScatter, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f64_4, ncclFuncReduceScatter, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f64_4, ncclFuncReduceScatter, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f64_2, ncclFuncReduceScatter, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f64_4, ncclFuncReduceScatter, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn)In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f64_2, ncclFuncReduceScatter, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f64_2, ncclFuncReduceScatter, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f64_4, ncclFuncReduceScatter, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:7:11 warnings generated when compiling for host. 1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f64_2, ncclFuncReduceScatter, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWork/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f64_4, ncclFuncReduceScatter, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | Coll().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f64_4, ncclFuncReduceScatter, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f64_2, ncclFuncReduceScatter, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f64_4, ncclFuncReduceScatter, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f64_4, ncclFuncReduceScatter, FuncProd, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1200. 12 warnings generated when compiling for gfx90a. [ 91%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_f8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_f8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_f8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_f8.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | 12 warnings generated when compiling for gfx90a. uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ [ 91%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_u32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_u32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_u32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_u32.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.hIn file included from :145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t dat/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:a1, f15lag1, d:ata2, f lag2; note: | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.hexpanded from macro 'barrier_by_group':145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t29 data1 | , flag1, d ata2, const int flag2; w | ^~~~~ = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: 75:7:/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h warning: unused variable 'w' [-Wunused-variable] :75 | 80 barrie:r_by_gro5up(); : | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h: 29warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_:g15: note: expanded from macro 'barrier_by_group' r29 | const iont w = uthreapdIdx.x/W(ARP_SIZE; \ | ^ ); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ : note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1,In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28:In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.hp:145:35:t warning: unused variable 'flag2' [-Wunused-variable] r145 | u int32_=t data 1, flagr1, datea2, flcag2; v| ^~~~~ Ptr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f8_2, ncclFuncReduceScatter, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f8_2, ncclFuncReduceScatter, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f8_4, ncclFuncReduceScatter, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f8_2, ncclFuncReduceScatter, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f8_2, ncclFuncReduceScatter, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f8_4, ncclFuncReduceScatter, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f8_2, ncclFuncReduceScatter, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 11 warnings generated when compiling for host. 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f8_2, ncclFuncReduceScatter, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f8_2, ncclFuncReduceScatter, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f8_4, ncclFuncReduceScatter, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f8_2, ncclFuncReduceScatter, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>()./builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f8_4, ncclFuncReduceScatter, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threa 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ dIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f8_2, ncclFuncReduceScatter, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f8_4, ncclFuncReduceScatter, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, prot/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.ho:508:,29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | u tnid(tird), ntohreadll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15s(n:threa ds), note: wid(tfield 'nthreads' will be initialized after field 'tidInBlock'id%WAR P_SIZ E)670 | tid(tid), nthre, waarp(dtid/WsARP_S(IZE)n, | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | threads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(gflragThoreadu((tipd%4))==3),, gr oup( grou| p), ^~~~~~~~~~~ | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_Prod_f8_2, ncclFuncReduceScatter, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIM/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] PLE_Prod_f8_2, n c670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->clFuncReduceScatter, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatchredOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce, ty, redop, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ty>, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f8_4, ncclFuncReduceScatter, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f8_4, ncclFuncReduceScatter, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof( | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f8_2, ncclFuncReduceScatter, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid)o,p, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), gr nthreads(nthreads),oup(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f8_4, ncclFuncReduceScatter, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f8_4, ncclFuncReduceScatter, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_f8_4, ncclFuncReduceScatter, FuncProd, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | baIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group'rrier_by_group 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ (); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | constIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7:In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | warning: unused variable 'w' [-Wunused-variable] 75 | barr ^~~~~i /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ er_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] In file included from 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u32_2, ncclFuncReduceScatter, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u32_2, ncclFuncReduceScatter, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u32_2, ncclFuncReduceScatter, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ze_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u32_2, ncclFuncReduceScatter, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u32_4, ncclFuncReduceScatter, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIM/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)PLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u32_4, ncclFuncReduceScatter, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthrea 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | ds), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_Prod_u32_2, ncclFuncReduceScatter, FuncProd, uint32_t, NCCIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x)L_,ALGO_ RING,g NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ roup(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u32_2, ncclFuncReduceScatter, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u32_2, ncclFuncReduceScatter, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBloc/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { k(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prtid(tid), nthreads(nthreims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid ads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u32_4, ncclFuncReduceScatter, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u32_4, ncclFuncReduceScatter, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u32_4, ncclFuncReduceScatter, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threaIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u32_2, ncclFuncReduceScatter, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ dIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u32_2, ncclFuncReduceScatter, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u32_2, ncclFuncReduceScatter, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 a?lgo, proto,n unrolcl>().rucn(); \l | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hS:670:15h: note: field 'nthreads' will be initialized after field 'tidInBlock' m670 | e tid(mtid), .nthreadcs(nthroeads),m tidInmBlock(.threbuffSizes[NCCL_PROTO_SIMPLE]/NCCL_adISdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x),TEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, grwoup(ogroupr), k| ^~~~~~~~~~~ ->redOpArg, 0, work->connIndex, work->connIndex); | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connInd/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(teix, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u32_2, ncclFuncReduceScatter, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ d, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u32_4, ncclFuncReduceScatter, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u32_2, ncclFuncReduceScatter, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u32_4, ncclFuncReduceScatter, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u32_4, ncclFuncReduceScatter, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u32_4, ncclFuncReduceScatter, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u32_4, ncclFuncReduceScatter, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u32_4, ncclFuncReduceScatter, FuncProd, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx942. 12 warnings generated when compiling for gfx90a. [ 91%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_u64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_u64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_u64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_u64.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp 12 warnings generated when compiling for gfx90a. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h[ 91%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_u8.cpp.o :14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_u8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_u8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_u8.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.hIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | :11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_Prod_u64_2, ncclFuncReduceScatter, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u64_2, ncclFuncReduceScatter, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u64_2, ncclFuncReduceScatter, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u64_2, ncclFuncReduceScatter, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u64_2, ncclFuncReduceScatter, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u64_2, ncclFuncReduceScatter, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u64_4, ncclFuncReduceScatter, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatt/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u64_4, ncclFuncReduceScatter, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ l, ty, redop, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u64_4, ncclFuncReduceScatter, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u64_2, ncclFuncReduceScatter, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u64_4, ncclFuncReduceScatter, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ (threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u64_2, ncclFuncReduceScatter, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u64_2, ncclFuncReduceScatter, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u64_2, ncclFuncReduceScatter, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, a 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u64_2, ncclFuncReduceScatter, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ lgo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u64_2, ncclFuncReduceScatter, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u64_4, ncclFuncReduceScatter, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u64_4, ncclFuncReduceScatter, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u64_4, ncclFuncReduceScatter, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.bu->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u64_4, ncclFuncReduceScatter, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u64_4, ncclFuncReduceScatter, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u64_4, ncclFuncReduceScatter, FuncProd, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ : In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u8_2, ncclFuncReduceScatter, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u8_2, ncclFuncReduceScatter, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u8_2, ncclFuncReduceScatter, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u8_4, ncclFuncReduceScatter, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u8_2, ncclFuncReduceScatter, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u8_2, ncclFuncReduceScatter, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_Prod_u8_2, ncclFuncReduceScatter, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u8_2, ncclFuncReduceScatter, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u8_2, ncclFuncReduceScatter, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, we_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u8_2, ncclFuncReduceScatter, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ork->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u8_4, ncclFuncReduceScatter, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u8_4, ncclFuncReduceScatter, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u8_2, ncclFuncReduceScatter, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u8_4, ncclFuncReduceScatter, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u8_4, ncclFuncReduceScatter, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u8_4, ncclFuncReduceScatter, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, w/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u8_4, ncclFuncReduceScatter, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:O15L:L >note: (field 'nthreads' will be initialized after field 'tidInBlock't id, nt h670r | e a d s ,t iwdo(rtki)d;) , | n ^t hreads(nthreads), t/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hi:d432In:B78l:o cnote: kin instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here( threadIdx .432x | ) , g r o uipf( g(rtoiudp )<, s u| b ^~~~~~~~~~~~~~~~~t n) /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hR:u670n:W60o:r knote: Cfield 'group' will be initialized after field 'stepSize'o ll (t)i.drIunnB(ltoicdk,( tshurbetand,I dwxo.rxk)),; g r| o ^u p(group),/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp : 7| : ^~~~~~~~~~~1 : note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u8_2, ncclFuncReduceScatter, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(grouork->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u8_2, ncclFuncReduceScatter, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ p), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u8_4, ncclFuncReduceScatter, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u8_4, ncclFuncReduceScatter, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u8_4, ncclFuncReduceScatter, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_prod_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Prod_u8_4, ncclFuncReduceScatter, FuncProd, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx1200. 12 warnings generated when compiling for gfx90a. [ 92%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_bf16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_bf16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_bf16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_bf16.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp 11 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_gIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ roup(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ , data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptIn file included from r = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 12 warnings generated when compiling for gfx90a. [ 92%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_bf8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_bf8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_bf8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_bf8.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf16_2, ncclFuncReduceScatter, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf16_2, ncclFuncReduceScatter, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthre173a: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf16_2, ncclFuncReduceScatter, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | ds(nthreads), tidInBlock(threadIdx.x), group(gr RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ oup), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf16_2, ncclFuncReduceScatter, FuncSum, hip_bfloat1work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subt6n, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthrea) RunWorkds), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ Coll/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf16_4, ncclFuncReduceScatter, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf16_2, ncclFuncReduceScatter, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadId/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_Sum_bf16_2, ncclFuncReduceScatter, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ x.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf16_2, ncclFuncReduceScatter, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf16_4, ncclFuncReduceScatter, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf16_2, ncclFuncReduceScatter, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), L_STgEPS/sizeofr(T) : stepoSize_) { u| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.hp:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here( 34 | g prims(tidr, nthreads,o &ring->pruev, &ring->pnext, work-)>sendbuff,, work->r | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ ecvbu| f tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | f , work->resdtepSiOpArgz,e( 0,stepSize w_ork->con == 0 ? nccnIndex,l work->connIndex); | Shmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbu ^ f/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: fin instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | ru,nRing(tid, ntrhreads, wokrk); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h-:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432> | if (trid < subten) RunWorkcCollf, work().run(tid, subtn, work)->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5; | : ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp: 7:1: note: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested hereDEFINE _ncclD evFunc(Redu65ceSca | runRing(tidduceS,catter nthreads,, Fu ncSum, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hhip_bfl:oat16,432 N:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tiCCLd_ALGO_R ING, N().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncccoll,l tyD, reedop,F alugo, pnrotoc, un(rollR>().reun()d; \ u| ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hc:670:e15: note: Sfield 'nthreads' will be initialized after field 'tidInBlock' c670 | a tid(tid), tnthrteaer_RING_SIMPLE_Sum_bf16_2, dsn(nthcreadsc), tlidInFBlocuk(thnreadcIdx.Rx),edu gcroueScatter, Fup(grnoup),c | ^~~~~~~~~~~~~~~~~S /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:u670m, hip_bfloat16, NCCL_A:L60: note: Gfield 'group' will be initialized after field 'stepSize' O670 | _ tid(Rtid)I, nNthreads(nthreads), tidInBlG, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBoack(thtreadcIdx.hx), , algo, protoup(ogrou,p), | ^~~~~~~~~~~ unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf16_4, ncclFuncReduceScatter, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf16_4, ncclFuncReduceScatter, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf16_4, ncclFuncReduceScatter, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf16_2, ncclFuncReduceScatter, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf16_4, ncclFuncReduceScatter, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf16_2, ncclFuncReduceScatter, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ lock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34::670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 07: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | ? prims( tid, nthnreads, &rcing->prev,c &ring->nlext, work->sendbuSff, work->recvbufhf, workm->redOpAerg, 0, womrk->conn.Index, cwork->oconnIndmex)m.buffSi; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.hz:es[NCCL_PROTO65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here_ 65 | SIMPLE]/NCCL runRing, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreaOp, dProto, CsOLL_UNRO,LL>(tid, nthrea&ds, work)r; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hi:432:78: nnote: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | g if (-tid < >subtnp) RunWorev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, rkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclwork->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | Dev Func( ReducreScattuer_nRING_RSIMPLiE_Snum_bf1g6_4, (tid, FuncSnum, hip_bfthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hloat16:, NCC432L_ALG:O_RIN78G, NCC: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | L_PR OTO_ SIMPLiE, 4)f (tid < sub | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' tn) RunWorkCol611l | , , COalgo,LL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFun cproto,( unroRll>()e.run(d); \ u | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hc:670:e15: note: field 'nthreads' will be initialized after field 'tidInBlock'S 670 | c tiad(tidt), nthtreadse(nthrreads)_, tidRInBlocING_SIMPLE_Suk(mthre_adIdxb.x), f16_4, ncclFuncRedgrouup(grocup), e| ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hS:670:catter, FuncSum, hip_bfloat1606: note: field 'group' will be initialized after field 'stepSize', 670 | tidN(tidC), ntChreadsL(nt_ALGOhreads)_RING,, tid InNBCCL_PROTO_SIMPLE, 4) lock(threadIdx.x), gro | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run();up(grou p), | \ ^~~~~~~~~~~ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf16_4, ncclFuncReduceScatter, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf16_4, ncclFuncReduceScatter, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf16_4, ncclFuncReduceScatter, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y,In file included from head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(29:15: note: expanded from macro 'barrier_by_group' ) 29 | const int w = threadIdx.x/WARP;_SIZE; \ | ^ | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ dx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1,In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group()In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from ; | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h29:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :15: note: expanded from macro 'barrier_by_group' 29 | const int w = data2, flag2; | ^~~~~ threadIdx.xIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, fl/WARP_SIZE; \In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h: 11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174| : /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: ^warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_gIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ roup(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ ag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cppIn file included from :2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: :In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:8011:5: warning: unused variable 'w' [-Wunused-variable] 80: | baIn file included from rrier_by/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h_group(:); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:17429:15: note: expanded from macro 'barrier_by_group' : 29 | c/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.honst int: w = th145readId:x.x/WARP14_SIZE; :\ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint3 2_t duata1,i flagn1, dtata2,3 flag22; | _ ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:t145:35: warning: unused variable 'flag2' [-Wunused-variable] 145d | auint3t2_t daata1, 1flag1,, dat a2, fflag2;l | ^~~~~ ag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 29 | const int w = threadIdx.x/WARP_SIZE; In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | cons\ | t ^ int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: In file included from unused variable 'ptr' [-Wunused-variable] 271 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ uint64_tIn file included from * ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIn file included from Idx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:_SI2Z: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.hE:; \ | 174 ^ : /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.hIn file included from :145:35/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: :In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.hwarning: :145:14unused variable 'flag2' [-Wunused-variable]: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t145 data | 1, fla g1, data2, fl ag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h uint32_t data1, flag:1451:21: warning: ,unused variable 'flag1' [-Wunused-variable] 145 | udint3a2_t data1, flag1, data2ta2, flag2; | ^~~~~ , flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIthreZadIdxE.x/WA;RP_S \ | ^ IZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); reads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf8_2, ncclFuncReduceScatter, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf8_2, ncclFuncReduceScatter, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthread/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf8_4, ncclFuncReduceScatter, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ s), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf8_4, ncclFuncReduceScatter, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_Sum_bf8_2, ncclFuncReduceScatter, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf8_2, ncclFuncReduceScatter, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid , FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf8_2, ncclFuncReduceScatter, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf8_2, ncclFuncReduceScatter, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf8_2, ncclFuncReduceScatter, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf8_2, ncclFuncReduceScatter, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf8_2, ncclFuncReduceScatter, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf8_2, ncclFuncReduceScatter, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf8_2, ncclexpanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreFuncReduceScatter, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().runads(nthreads), tid(I); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ nBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf8_4, ncclFuncReduceScatter, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf8_4, ncclFuncReduceScatter, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf8_4, ncclFuncReduceScatter, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeofSIMPLE]/NCCL_STEPS/sizeof(T) : s(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf8_4, ncclFuncReduceScatter, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf8_2, ncclFuncReduceScatter, FuncSum, rccl_bfloat8, NCCL_ALGO_RI/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, wNG, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdxork->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf8_4, ncclFuncReduceScatter, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ .x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf8_4, ncclFuncReduceScatter, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf8_4, ncclFuncReduceScatter, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf8_4, ncclFuncReduceScatter, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_bf8_4, ncclFuncReduceScatter, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx942. 12 warnings generated when compiling for gfx90a. [ 92%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_f16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_f16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_f16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_f16.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp 11 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/tWhreadIdx.x/WARP_SIZE; \ | ^ ARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flagIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ adIdx.x/WARP_SIZE; \ | ^ :2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11In file included from : In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable]/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h: 174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | 75 uint | 32_t dat a1, flag1 , data2, flag 2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | barri er_by_gruoup(); i | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.hn:29:15: note: texpanded from macro 'barrier_by_group' 29 | 3 cons2t int w _= threatdIdx.x /WARP_SdIZE; \ a| ^ ta1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:In file included from 2/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp: :In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:72: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_g barrrier_boy_groupu(); | p ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:(15: note: expanded from macro 'barrier_by_group' ) 29 | ; con st int w = th| readI ^~~~~~~~~~~~~~~~~~dx.x/W ARP_SI/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.hZ:29:15: note: expanded from macro 'barrier_by_group' E; \ 29 | ^ | const i: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: nt w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ )+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | In file included from barr/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cppi:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:e11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: r/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7:_ warning: unused variable 'w' [-Wunused-variable] b75 | y barri_er_by_ggroupr(); | o ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.hup(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: :29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: : In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14:: warning: 145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, da/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:t28: warning: unused variable 'data2' [-Wunused-variable] a 145 | 2 uint,32_t d ata1, fflag1,l data2a, flagg2; | ^~~~~ 2/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:8035:; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ : warning: unused variable 'flag2' [-Wunused-variable] warning: unused variable 'flag2' [-Wunused-variable] 145 | uint14532_t | d uint32_t data1,ata1, flag1, data2, flag2; | ^~~~~ flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:=2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h: 11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175t: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:h5: warning: runused variable 'w' [-Wunused-variable] 80 | e barrier_bya_grdoup(I); | d ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29x:15: note: .expanded from macro 'barrier_by_group' 29x | c/onst Wint w A= thrReadIPdx._x/WARPS_SIZEI; \ ZE; \ | ^ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f16_2, ncclFuncReduceScatter, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f16_2, ncclFuncReduceScatter, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f16_2, ncclFuncReduceScatter, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f16_2, ncclFuncReduceScatter, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f16_4, ncclFuncReduceScatter, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), gOp, Proto, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, workroup(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f16_2, ncclFuncReduceScatter, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f16_2, ncclFuncReduceScatter, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ InBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_Sum_f16_2, ncclFuncReduceScatter, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f16_2, ncclFuncReduceScatter, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), groutpid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f16_2, ncclFuncReduceScatter, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ (group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f16_4, ncclFuncReduceScatter, FuncSum, half, NCCL_ALGO_RING, N670C | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f16_4, ncclFuncReduceScatter, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ CL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch,/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173 algo, proto, /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f16_4, ncclFuncReduceScatter, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthr: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.he:670:15: awarning: initializer order does not match the declaration order [-Wreorder-ctor] d670 | stid(t(id),n nthreatds(nthrheads),r tidIneBlock(athreaddIdxs),.x), tidInBglroup(ogck(threadIdx.roup), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIx)M, grouPp(grouLp), | E ^~~~~~~~~~~ ]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] runRing( tid, tnthrid(tid), nthreads(nthreads), tidInBealds, woork);c | ^ k/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:(78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().rutn(tid, hreasdIdux.x),b groutp(grounp), ,| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671w | ostepSirze(stkepSize)_ == ; | 0 ^ ? n cclSh/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cppmem.comm.buffSi:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested herezes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f16_2, ncclFuncReduceScatter, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f16_2, ncclFuncReduceScatter, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f16_2, ncclFuncReduceScatter, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f16_4, ncclFuncReduceScatter, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthre:15: warning: initializer order does not match the declaration order [-Wreorder-ctor]a 670d | tisd(tid)), nthr,eads(n threadts), tiidInBlodck(thrIeadIdxn.x), gBroup(glroup),o | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | c tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | k st(epSizet(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepShreadIdx.x), group(group), | ^~~~~~~~~~~ ize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f16_4, ncclFuncReduceScatter, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f16_4, ncclFuncReduceScatter, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthread/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSizes(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ _) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f16_4, ncclFuncReduceScatter, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f16_4, ncclFuncReduceScatter, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing11 warnings generated when compiling for host. (tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f16_4, ncclFuncReduceScatter, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f16_4, ncclFuncReduceScatter, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 12 warnings generated when compiling for gfx90a. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1030. [ 92%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_f32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_f32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_f32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_f32.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp 11 warnings generated when compiling for gfx1200. 1111 warnings generated when compiling for gfx1102. warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1201. 12 warnings generated when compiling for gfx90a. [ 93%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_f64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_f64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_f64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_f64.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, hIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ ead, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_gIn file included from roup(/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h); :11: | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrIn file included from ier_by_group(); /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2 | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ : In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barriIn file included from er_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15:In file included from note: expanded from macro 'barrier_by_group' 29 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: const iIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h::11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 15: note: expanded from macro 'barrier_by_group'nt w = th readIdx.x/W ARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from 29 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.hconst in:t w = threadId174x.x/WARP_SIZE; \ : | ^/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: In file included from warning: unused variable 'data1' [-Wunused-variable] 145 | u/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: iIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.hn:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.hIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2t32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ :145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ : In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.hb:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: y/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5:_ warning: unused variable 'w' [-Wunused-variable] 80g | rbarriero_by_groupu(); | ^~~~~~~~~~~~~~~~~~p /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp15(::2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11): ; | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] note: expanded from macro 'barrier_by_group' 29 | const int75 | w = thrbea/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpparrier_by:dIdx.2x/WA_RIn file included from : In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11group(P_SIZE; \ | ^ : In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ int w = threadIdx.x/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from //builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from W/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:A271:19: warning: RIn file included from unused variable 'ptr' [-Wunused-variable] 271P | _uint64_St* ptr =I rZE; \/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from ecvPtr (0)/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h+l | ^ :11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | l128Off set; | ^~~ uint32In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:u7: warning: unused variable 'w' [-Wunused-variable] i75 | 175n: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h: 271:19t: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ barrier_by_groupIn file included from (/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from In file included from ); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] c 145 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cppou:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from i/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:n271:19n: warning: sunused variable 'ptr' [-Wunused-variable]t t int w = threadIdx.x/32_Wt dataA1, flag1, data2271 | , ui nt6f4_tR* ptP_SIZE; \ | ^ lag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ r = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t*In file included from ptr = recvPt/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:r(0)+ll128Of2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from f/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offsetset;; | | ^~~ ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271In file included from | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from Idx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f32_2, ncclFuncReduceScatter, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f32_2, ncclFuncReduceScatter, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f32_2, ncclFuncReduceScatter, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f32_2, ncclFuncReduceScatter, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f32_2, ncclFuncReduceScatter, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f32_4, ncclFuncReduceScatter, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f32_2, ncclFuncReduceScatter, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_Sum_f32_2, ncclFuncReduceScatter, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f32_2, ncclFuncReduceScatter, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f32_2, ncclFuncReduceScatter, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f32_2, ncclFuncReduceScatter, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f32_4, ncclFuncReduceScatter, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f32_4, ncclFuncReduceScatter, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f32_2, ncclFuncReduceScatter, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(ti 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ d), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_S/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tiTEdPS/size(of(T) : sttepSizie_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group d/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here) 34 | , prims( nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f32_4, ncclFuncReduceScatter, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runR | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x),in g(tiod, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f32_4, ncclFuncReduceScatter, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ up(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f32_4, ncclFuncReduceScatter, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f32_2, ncclFuncReduceScatter, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f32_4, ncclFuncReduceScatter, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f32_4, ncclFuncReduceScatter, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f32_4, ncclFuncReduceScatter, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f32_4, ncclFuncReduceScatter, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f32_4, ncclFuncReduceScatter, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.hIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barriwarning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.her_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1,In file included from data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, In file included from flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2,In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: (); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | warning: unused variable 'w' [-Wunused-variable] 75 | ba const int w = threadIdx.x/WARP_SIZE; \ | ^ rrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | In file included from barr/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:i11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173e: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7:r _bwarning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ y_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h | const int w = threadIdx.x/WARP_SIZE; \ | ^ :29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f64_2, ncclFuncReduceScatter, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f64_2, ncclFuncReduceScatter, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f64_2, ncclFuncReduceScatter, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f64_2, ncclFuncReduceScatter, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f64_4, ncclFuncReduceScatter, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f64_4, ncclFuncReduceScatter, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBloc/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hk(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f64_4, ncclFuncReduceScatter, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f64_2, ncclFuncReduceScatter, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f64_4, ncclFuncReduceScatter, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f64_2, ncclFuncReduceScatter, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f64_2, ncclFuncReduceScatter, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbu/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint6ff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f64_2, ncclFuncReduceScatter, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 4_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_Sum_f64_2, ncclFuncReduceScatter, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f64_2, ncclFuncReduceScatter, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f64_2, ncclFuncReduceScatter, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f64_4, ncclFuncReduceScatter, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(ti/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ =d), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_P= 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, woROrTk->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatteO_SIMPLE]/NCCL_r_RING_SIMPLE_Sum_f64_4, ncclFuncReduceScatter, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreadSTEPSs), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f64_4, ncclFuncReduceScatter, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f64_2, ncclFuncReduceScatter, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f64_4, ncclFuncReduceScatter, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f64_4, ncclFuncReduceScatter, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f64_4, ncclFuncReduceScatter, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f64_4, ncclFuncReduceScatter, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx1102. 12 warnings generated when compiling for gfx90a. [ 93%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_f8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_f8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_f8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_f8.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp 1111 warnings generated when compiling for gfx1200. warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx1030. 12 warnings generated when compiling for gfx90a. [ 93%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_u32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_u32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_u32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_u32.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flaIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] g1, d271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ ata2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | coIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flanst int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from g2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_Sum_f8_2, ncclFuncReduceScatter, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, ntIn file included from hreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~:65:5 : note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | r| unRing< tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_T, RedO p, Prot o, CO671 | stepSLiL_UNROLzL>(tid, enthread(s, works); | ^t /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:e78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here p 432 | S if (tidi < ze_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PRsOubtnTO_SIMPLE]/NCCL) Run_WorkCSollT().run(tid, subtn, work); | EPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | pr ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:i7:1: note: min instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DsEFINE_(ncclDtevFunci(Reducde, nthreads, &riScattner_RINgG_SIMP-LE_Su>prev, m_f&8_2, ring->next, work->sendbuff, work->recnccvlFuncRedbuceScatuter, FfuncSufm, rcc,l_flo work->redOpArg, 0, work->connIndex,at8 , wNCCLo_ALGrO_RIkNG, -NCCL>_PROcTO_SoIMPLnEIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | , 2n) | Index); | ^^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f8_2, ncclFuncReduceScatter, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nll, tty, redhop, ealgo,a prodtos, un,roll >().wrun(o); rk); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:\ 432| ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h::670:7815: note: :field 'nthreads' will be initialized after field 'tidInBlock' note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subt n670 | ) ti d(tiRd), unthrneads(Wnthroeadsr), tkColl, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runrnBlocok(thtreadIodx, COL.xL), g_roupU(groNup), R | ^~~~~~~~~~~~~~~~~ OLL>().run(tid, subtn, wo/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hr:670:60k: note: field 'group' will be initialized after field 'stepSize') 670 | tid(tid), nthreads(ntRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f8_2, ncclFuncReduceScatter, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ hreads), tidI; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScanBlotck(tthreaedIdxr.x),_ groRup(gING_SIMPLE_Sum_f8_2, ncclFuncReduceScatter, FuncSumr,oup) , | r ^~~~~~~~~~~ ccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f8_2, ncclFuncReduceScatter, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f8_2, ncclFuncReduceScatter, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f8_2, ncclFuncReduceScatter, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h runRing(ti,d, nthre ads, wor k); | ^ | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkCollhmem.com(m.buffSi)zes[NCCL.run(tid, _PROTsO_SIMPubtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFuncLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->(RneduceeScatxter_tRING_S,IMPL E_Sum_wf8_4,o ncclrFuncRkeduce-Scatt>eser, FuncSum, rccl_float8, NCCL_ALGOndbuf_f, woRrk->rIecvbuNff, wGork->,redOp ArgNCCL, _0PROTO_SIMPL, woErk->c,o 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWnnInodex,r workk->Batch, ProtoSimple<2, 2, 4>, 4>' requested here y65 | >, algo, proto, unroll> runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | ().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(grouif p(tid) < s,ubtn ) Ru nWor| kCol ^~~~~~~~~~~~~~~~~ld().sr)un,(ti d,t suibtnd, wIork); nBlock(threadIdx.x), group | ^( /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cppg:12:r1: onote: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here u 12p | DE)FI,N E_n ccl| Dev ^~~~~~~~~~~Fun c(ReduceScatter_RING_SIMPLE_Sum_f8_4, ncclFuncReduceScatter, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f8_4, ncclFuncReduceScatter, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f8_4, ncclFuncReduceScatter, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclrunRiShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f8_4, ncclFuncReduceScatter, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ng(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f8_2, ncclFuncReduceScatter, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f8_4, ncclFuncReduceScatter, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f8_4, ncclFuncReduceScatter, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f8_2, ncclFuncReduceScatter, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f8_4, ncclFuncReduceScatter, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f8_2, ncclFuncReduceScatter, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f8_2, ncclFuncReduceScatter, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f8_4, ncclFuncReduceScatter, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f8_4, ncclFuncReduceScatter, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_f8_4, ncclFuncReduceScatter, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable]In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | 145 | ^~~~~ uint 32_t da/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.hta1, flag1, d:ata2, f145lag2; :| ^~~~~ 28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from :11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flIn file included from ag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2;/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | bar:271:r19: warning: unused variable 'ptr' [-Wunused-variable] i 271 | e uinrt64_t* _ptr = brecvPtyr(0)+ll_128Ogffset; | ^~~ roup(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ldata2, flag2; | ^~~~~ l128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u32_2, ncclFuncReduceScatter, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u32_2, ncclFuncReduceScatter, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:/2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:s173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:i15: warning: initializer order does not match the declaration order [-Wreorder-ctor] zeof(T) : 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCLstepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIn_STEdPS/seizeof(Tx) : ,stepSi ze_) {w | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ o| group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.hr:34k->connIndex);:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(| tid, ^n /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | threads , &rin g->prevr,u &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connnRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work);); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h :65:5:| note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here ^65 | r/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cppunRing (tid,note: nthrin instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested hereeads, work) ; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h12 | DEF:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, suncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, 1, 2, 2>::run' requested here o7 | DEp, algo, proto, FINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u32_2, ncclFuncReduceScatter, FuncSum, uint32_t, NCCL_ALGO_RING, NCCunL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: roll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock'note: 670 | expanded from macro 'DEFINE_ncclDevFunc' tid (tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(gr611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), touip), d| ^~~~~~~~~~~ InBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sprims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->izeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_Sum_u32_2, ncclFuncReduceScatter, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(thrtideadIdx.x), g, nthrreads, woork); u | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hp:432:78: note: (in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | g if r(tid < osubtn) RuunWorkCpoll() .r stepSize(stuen(tid,p subtn,Size_ = work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SI= 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | pMriPmLE_Sums_u32_(4, nctclFunicReducdeScat,ter, FuncSnum, utinthreads, &ring->prev, &ring->next, work->sendbuff, w32_ot, NrCCL_AkLGO_R-ING, N>CCL_PrROTO_eSIMPLEc, 4) v | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hb:uff, wor611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670k->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing:(15: note: tfield 'nthreads' will be initialized after field 'tidInBlock' i670 | d tid,(tid ), nnthrteadsh(nthreads), tidInBlock(threadrIdxe.xads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: )in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here, gr oup (group),432 | ^~~~~~~~~~~~~~~~~ | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h: 670:60: note: field 'group' will be initialized after field 'stepSize' 670 | if ( t tiid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u32_2, ncclFuncReduceScatter, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PRO11 warnings generated when compiling for host. TO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u32_4, ncclFuncReduceScatter, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u32_2, ncclFuncReduceScatter, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u32_2, ncclFuncReduceScatter, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | Run/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NWorkCBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthre_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, adsw), tidIonBlockr(threakdIdx.x-), gro>up(grocup),onnIndex); | ^ | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u32_4, ncclFuncReduceScatter, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u32_2, ncclFuncReduceScatter, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ coll, ty, redop, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u32_2, ncclFuncReduceScatter, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u32_2, ncclFuncReduceScatter, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subIn file included from tn, wor/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tidk); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u32_4, ncclFuncReduceScatter, FuncSum, uint32_t, N)C, nthreads(nthreaCds), tidInBlL_ALGO_RINock(thrGeadIdx.x), group,(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSNizCCL_Pe(stepROSiTO_SIMPLE, ze_ == 40 ? n) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hcclShmem.:611:62: cnote: omm.buexpanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:Batch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(gr7: note: oin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | u prims(tid, nthreads, &ring->prev, &ring->next, pwork->se)ndbuff, wor,k->recvbuff, w | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(ork->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u32_4, ncclFuncReduceScatter, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreck(threadIdx.x), group(group), | ^~~~~~~~~~~ LL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u32_ads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 2, ncclFuncReduceScatter, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u32_4, ncclFuncReduceScatter, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here In file included from 34 | prims(tid, nthreads, &ring->prev, &ring->next, wo/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: rIn file included from k->send/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.hbuff,: wor173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: k->recvbuff,warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x) wor,k->redO pArg, 0g, workr->connoIndex,u work-p>connI(ndex);g | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.hr:65:5:o note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65u | rpu), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSiznReing(tid,0 nthre ? ncclShmem.comm.buffSizadse, works); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78[: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSi 432 | if (tid < subtn) RunWorkColl, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prAlgeo, Prvoto,, CO LL_U&NROLrL>(i).runn(tigd, su-bt>n, wnork)e; | x ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cppt:12:1,: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | wDEFIoNE_nrcclDekv->sendbuffFu,nc( ReduwceScork->reattcer_RIvbuff, work->redOpArg, 0, woNG_SrIMPkLE_S-um_u>32_4c, ncoclFuncReducennIndex, woSrcattker, -Func>SuconnIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | rmu, uinRnit32n_t,g NC(tid, nthreads, work); | ^PL E, /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h::611432:62:: note: expanded from macro 'DEFINE_ncclDevFunc'78 :611 | note: Runin instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested hereWor kBa tch , a lgoi, prfoto , u(nrtoid < subtn) RunWorkColld().Orupn(),; \A | l ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hg:670o:15:, note: field 'nthreads' will be initialized after field 'tidInBlock' 670P | r tiod(ttido, COLL_UNROLL>().run(tid, subtn,), ntwhreoadsr(ntkhre)ads),; t idI nBl| ock ^(th rea/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cppdIdx.x):, 7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINgroup(group), | ^~~~~~~~~~~~~~~~~ E_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u32_2, ncclFuncRedu/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hc:670e:60S: cnote: field 'group' will be initialized after field 'stepSize' a 670t | t teid(rtid,), nFthrueadns(nthreadsc), StiduInBlock(mthr,ead Iudint32_t, NCCL_ALGO_RING, Nx.Cx), CgroLup(_groPup)R, O| ^~~~~~~~~~~ TO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u32_4, ncclFuncReduceScatter, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u32_4, ncclFuncReduceScatter, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u32_4, ncclFuncReduceScatter, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx1201. 12 warnings generated when compiling for gfx90a. [ 93%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_u64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_u64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_u64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_u64.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp 12 warnings generated when compiling for gfx90a. 11 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from 11 warnings generated when compiling for gfx1201. /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flaIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ g2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const i/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ nt w = t| h ^read Idx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_In file included from by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | conIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173st: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ int w = th/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29readIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, fla:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | ug1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32i_ntt32 d_ta dtataa11,, fflalg1a, gda1ta2,, dflaagt2;a 2| ^~~~~, f/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.hlIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ a:g1452:21:; warning: unused variable 'flag1' [-Wunused-variable] | ^~~~~145 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h uint:32145_t: d35at: warning: unused variable 'flag2' [-Wunused-variable] 145a | 1 , f la g1u,int d3at2_at2, data1, flag1, daflag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, datat2,a f2la,g 2f;l a | g ^~~~~ 2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | [ 94%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_u8.cpp.o const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_u8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_u8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_u8.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ : In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u64_2, ncclFuncReduceScatter, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u64_2, ncclFuncReduceScatter, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u64_2, ncclFuncReduceScatter, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: : In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] In file included from 670 | t/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.hid(tid),: nthrea173ds(nt: hread/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670s:15), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(st: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIepSize_M == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u64_2, ncclFuncReduceScatter, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadCIOLL_UNROLdL>().runx(tid, subtn., work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:7:1:x note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here), grou p 7 | DEFIN(grE_oncucpl),De v | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' Fun670c(ReduceS | catter _ RI tid(tid),NG_SIMPLE_Sum_u64_2, ncclFuncReduceScatter, FuncSum, uint64_t, NCCL_/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hALGO_RING, N:670:15: warning: ntinitializer order does not match the declaration order [-Wreorder-ctor]hreads( nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ CCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatchtid), nt,hreads(nthr eads), tiadInBlock(tlhreadIdx.gx), grouop(group),, | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ p| r tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_oto ,671 | u nroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | t stepSiize(stepSizde_ == (0 ? ncclSthmem.comm.ibuffSizd)es, nthread[NCsCL_PROTO_(nthreads)SIMPLE]/NC,CL t_STEPS/sizeof(TidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u64_4, ncclFuncReduceScatter, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_Sum_u64_2, ncclFuncReduceScatter, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ LL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u64_2, ncclFuncReduceScatter, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u64_2, ncclFuncReduceScatter, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u64_2, ncclFuncReduceScatter, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u64_4, ncclFuncReduceScatter, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u64_2, ncclFuncReduceScatter, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u64_4, ncclFuncReduceScatter, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u64_4, ncclFuncReduceScatter, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u64_4, ncclFuncReduceScatter, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u64_2, ncclFuncReduceScatter, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_)/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(gr { | o ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.hu:34:7:p note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | ) pr,ims(tid, n thread | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u64_4, ncclFuncReduceScatter, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ s, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u64_2, ncclFuncReduceScatter, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u64_4, ncclFuncReduceScatter, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u64_4, ncclFuncReduceScatter, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u64_4, ncclFuncReduceScatter, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u64_4, ncclFuncReduceScatter, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u64_4, ncclFuncReduceScatter, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:w2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h: 11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: =/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145: 14: warning: unused variable 'data1' [-Wunused-variable] t 145 | uhint3r2_t data1,e flag1,a dadta2, Iflag2; d | ^~~~~x .x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u8_2, ncclFuncReduceScatter, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u8_2, ncclFuncReduceScatter, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u8_2, ncclFuncReduceScatter, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u8_4, ncclFuncReduceScatter, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u8_4, ncclFuncReduceScatter, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), g/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROroup(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, TO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArgnthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRingconnIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here L>(t432id, nthr | eads, wo rk); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkCoill().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncc().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RINlDevFunc(RedGuceScat_ter_RINGS_SIMPLE_SIum_u8_4,M ncclFuPncReduceSLcatter,E Fun_cSSumum_u, 8u_i2n,t 8n_ctcl,FuncReduceScatter, FuncSum, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(op, nalgo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nththreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ reads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u8_2, ncclFuncReduceScatter, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_Sum_u8_2, ncclFuncReduceScatter, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u8_2, ncclFuncReduceScatter, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u8_2, ncclFuncReduceScatter, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u8_4, ncclFuncReduceScatter, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u8_4, ncclFuncReduceScatter, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u8_4, ncclFuncReduceScatter, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u8_4, ncclFuncReduceScatter, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u8_2, ncclFuncReduceScatter, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u8_2, ncclFuncReduceScatter, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBloIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidIck(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u8_2, ncclFuncReduceScatter, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ nBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u8_2, ncclFuncReduceScatter, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u8_4, ncclFuncReduceScatter, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]//builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u8_4, ncclFuncReduceScatter, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u8_4, ncclFuncReduceScatter, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_Sum_u8_4, ncclFuncReduceScatter, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1200. 12 warnings generated when compiling for gfx90a. [ 94%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1100. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx1101. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 11 warnings generated when compiling for gfx906. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 11 warnings generated when compiling for gfx1201. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data11 warnings generated when compiling for gfx942. 1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 11 warnings generated when compiling for gfx908. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | | ^~~~~b /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:a145:21: rwarning: unused variable 'flag1' [-Wunused-variable] 145r | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ ier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: : In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flagIn file included from 2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | consIn file included from t in/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.ht:271:19 w: warning: unused variable 'ptr' [-Wunused-variable] 271 | = threadIdx.x/WAuint64_t* ptr = RPr_eScIvZPEt;r (\0 )+| ^ ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28In file included from : warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 12 warnings generated when compiling for gfx90a. [ 94%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i32_2, ncclFuncReduceScatter, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i32_2, ncclFuncReduceScatter, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i32_2, ncclFuncReduceScatter, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBloIn file included from ck(threadIdx.x), group/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2(: In file included from g/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173roup), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, sub ? ncclShmem.comm.buffStn, worizes[NCCL_PROTO_SIk); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:7:1:MPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i32_2, ncclFuncReduceScatter, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432| :78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)432 | if (tid < subtn) RunWorkCo507ll().ruInnBlock(threadIdx.x/WARP(ti_d, subtSn, work)I; | ^ ZE/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp):, | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE12 :1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i32_4, ncclFuncReduceScatter, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nth508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_SumPostDiv_i32_2, ncclFuncReduceScatter, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ reads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i32_2, ncclFuncReduceScatter, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i32_2, ncclFuncReduceScatter, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i32_2, ncclFuncReduceScatter, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i32_4, ncclFuncReduceScatter, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i32_4, ncclFuncReduceScatter, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i32_4, ncclFuncReduceScatter, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:12:1In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidI: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i32_4, ncclFuncReduceScatter, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ nBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prROLL>().rev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reducun(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SeScatter_RING_SIMPLE_SumPostDiv_i32_2, ncclFuncReduceScatter, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>(IMPLE_SumPostDiv_i32_2, ncclFuncReduceScatter, FuncSumPostDiv, int32_t, NCC).run(); \ | ^ L_ALGO/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ _RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i32_4, ncclFuncReduceScatter, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i32_4, ncclFuncReduceScatter, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i32_2, ncclFuncReduceScatter, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i32_2, ncclFuncReduceScatter, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i32_4, ncclFuncReduceScatter, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i32_4, ncclFuncReduceScatter, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i32_4, ncclFuncReduceScatter, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i32_4, ncclFuncReduceScatter, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.hIn file included from :29:15: note: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cppexpanded from macro 'barrier_by_group': 29 | const int2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: /WAIn file included from RP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, f1, flalg1, dataa2, flag2g; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:1145:35: warning: unused variable 'flag2' [-Wunused-variable] 145, | uint 32_t datad1a,t af2l,a gf1l,ag2; | ^~~~~ data2, f/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.hlag:145:35: warning: unused variable 'flag2' [-Wunused-variable] 2; | ^~~~~ 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrierIn file included from _by_group(); /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* pIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271t:19: warning: unused variable 'ptr' [-Wunused-variable] r271 | uint64_=t* ptr = r ecvPtr(0r)+ll128Offeset; | ^~~c vPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.hIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :145:21: warning: unused variable 'flag1' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 145 | uint32_t data1, flag1, dIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 29 | const int w = threadIdx.x/WARP_SIZE; \ | ata2, flag2 ^ ; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZEIn file included from ), /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2w: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.ha:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | r tid(tid),p nthreads(n(threads), ttidInBloicd/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) k(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/ 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, worsizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->k->recvrbuff, worke->redOpArgc, 0, work-v>connIndebx, worku->connIndefx); | ^ f/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5:, note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65work->redOpArg, 0, work->connIndex, work->con | nrunRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | ndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) R u if (tnid < subWtn) RunWoorkColl().lrun(tid,l subtn<, work)F; n| ^, T/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp, R:e7dOp, Algo, Prot:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here o 7 | DEF,INE_n COLL_cclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i64_2, ncclFuncReduceScatter, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_SumPostDiv_i64_2, ncclFuncReduceScatter, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i64_2, ncclFuncReduceScatter, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i64_2, ncclFuncReduceScatter, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i64_4, ncclFuncReduceScatter, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] (threadIdx.x), group670( | g r tid(tid), nthreads(oup), n | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ t 671 | stherpeSizaeds), tidInBlock(th(stepSize_ == 0 ? ncclShmemreadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSize.comms.buffSi[zes[NCCL_PNROTO_SIMCPLE]/NCCLC_STEPS/siLzeof(T) :_ stepSize_P) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ R | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: OTO_SIMPLE]/NCCL_STEPin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here S 34 | / prims(tsid, nthirzeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ ea ds, | &ring->pr group(groupev, &rin g->next, /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.hwo:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | rk->sendbuff, work->recv prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work-b>uff, worck->redOpoArg, 0, wonrk->connInndex, workI->connIndnex); | ^ d/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65e:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRixng(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i64_2, ncclFuncReduceScatter, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i64_4, ncclFuncReduceScatter, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hnote: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i64_4, ncclFuncReduceScatter, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i64_2, ncclFuncReduceScatter, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i64_2, ncclFuncReduceScatter, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i64_2, ncclFuncReduceScatter, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i64_2, ncclFuncReduceScatter, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i64_4, ncclFuncReduceScatter, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i64_2, ncclFuncReduceScatter, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i64_4, ncclFuncReduceScatter, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i64_4, ncclFuncReduceScatter, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i64_4, ncclFuncReduceScatter, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i64_2, ncclFuncReduceScatter, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i64_4, ncclFuncReduceScatter, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i64_4, ncclFuncReduceScatter, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL| tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i64_2, ncclFuncReduceScatter, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ _PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i64_4, ncclFuncReduceScatter, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i64_4, ncclFuncReduceScatter, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1030. 12 warnings generated when compiling for gfx90a. [ 94%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1030. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint3/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 2_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1100. 12 warnings generated when compiling for gfx90a. 11 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const iIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ nt w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t daIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ ta1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | : warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:14515 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ : note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1,In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, In file included from d/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ ata2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ [ 95%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp.o In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | co/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ nst int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ : In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ : In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i8_2, ncclFuncReduceScatter, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unrolOLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDli>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ v_i8_2, ncclFuncReduceScatter, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i8_2, ncclFuncReduceScatter, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_SumPostDiv_i8_2, ncclFuncReduceScatter, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i8_2, ncclFuncReduceScatter, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i8_2, ncclFuncReduceScatter, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i8_2, ncclFuncReduceScatter, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | In file included from if (tid < subtn) RunWorkColl().run(ti/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tidd, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i8_4, ncclFuncReduceScat(ttid), nethreadsr(nthre,ads), tidInBlFock(uthreadIndx.x),c groupS(groupu), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~m | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ P671 | o stepsSize(sttepSizeD_ == 0i ? nv, int8_t, NCcclShmCL_ALGO_RING, NemCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62:.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h note: expanded from macro 'DEFINE_ncclDevFunc' : 611 | RunWorkBatch, algo, proto, unroll34>:7: (note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here )34 | . r priums(tnid, nthre(ads, )&rin;g- \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | >pre v, & ringt->neixt,d work-(>sentdbuid), nthreads(nff, tworhreads), tki->rdecvbuIff, nworkB->reldOpAock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | rtg, i0, dwor(k-t>coninInddex), w,ork ->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i8_2, ncclFuncReduceScatter, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i8_2, ncclFuncReduceScatter, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadId/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i8_4, ncclFuncReduceScatter, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | x.x), group(group), | ^~~~~~~~~~~ tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i8_4, ncclFuncReduceScatter, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i8_4, ncclFuncReduceScatter, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i8_2, ncclFuncReduceScatter, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i8_2, ncclFuncReduceScatter, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i8_2, ncclFuncReduceScatter, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ es[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i8_4, ncclFuncReduceScatter, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i8_4, ncclFuncReduceScatter, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i8_4, ncclFuncReduceScatter, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize11 warnings generated when compiling for host. _ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i8_4, ncclFuncReduceScatter, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i8_4, ncclFuncReduceScatter, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ _PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i8_4, ncclFuncReduceScatter, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_i8_4, ncclFuncReduceScatter, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_grouIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ p(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from :19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, datIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ a2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u32_2, ncclFuncReduceScatter, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work)/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | st;e | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u32_2, ncclFuncReduceScatter, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ pSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u32_4, ncclFuncReduceScatter, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u32_2, ncclFuncReduceScatter, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: 670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u32_2, ncclFuncReduceScatter, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidexpanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), InBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u32_4, ncclFuncReduceScatter, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.c/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), groomm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->conup(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | nI ndex, work->pconnInrdex); i | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:m79:5: note: sin instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | ( runtRingprev, &ringoLL128, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^( ).ru/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hn(tid, subt:n, w432ork):; | 78 ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp::5:1 : note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested herenote: 5in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here | DEFINE_n cclD evFunc(R432educe | Scat te r_RI NG_L L128 _Sum PostiDiv_u32_2, ncclFuncReduceScatter, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:f (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u32_4, ncclFuncReduceScatter, FuncS611u:62: mnote: expanded from macro 'DEFINE_ncclDevFunc' P611 | o RusnWorktBatch, algo, proto, unroll>().run(); \ | ^ Div, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u32_4, ncclFuncReduceScatter, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u32_2, ncclFuncReduceScatter, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u32_2, ncclFuncReduceScatter, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreadIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u32_2, ncclFuncReduceScatter, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); s(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u32_2, ncclFuncReduceScatter, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), ntIn file included from h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u32_2, ncclFuncReduceScatter, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ uffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u32_2, ncclFuncReduceScatter, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u32_4, ncclFuncReduceScatter, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u32_4, ncclFuncReduceScatter, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u32_2, ncclFuncReduceScatter, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u32_4, ncclFuncReduceScatter, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u32_4, ncclFuncReduceScatter, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(thr/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSieadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ze_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u32_4, ncclFuncReduceScatter, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u32_4, ncclFuncReduceScatter, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u32_4, ncclFuncReduceScatter, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx1200. 12 warnings generated when compiling for gfx90a. [ 95%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx1200. 12 warnings generated when compiling for gfx90a. [ 95%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:In file included from 2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2In file included from ,/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp :f2l: aIn file included from g/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h2:;11 : In file included from | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h ^~~~~: 173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7:/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h :warning: 145unused variable 'w' [-Wunused-variable]: 21: warning: unused variable 'flag1' [-Wunused-variable] 145 | 75 | u i n t 3b2a_rtr ideart_ab1y,_ gfrloaugp1(,) ;d a t| a ^~~~~~~~~~~~~~~~~~2 , fl/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.ha:g292:;15 : | note: ^~~~~expanded from macro 'barrier_by_group' /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145 :2928 | : warning: unused variable 'data2' [-Wunused-variable] cons t145 | i n uintt 3w2 _=t tdharteaa1d,I dfxl.axg/1W,A RdPa_tSaI2Z,E ;f l\a g 2| ; ^ | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.hIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ :145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ :7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_SumPostDiv_u64_2, ncclFuncReduceScatter, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u64_2, ncclFuncReduceScatter, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u64_2, ncclFuncReduceScatter, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), In file included from nt/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u64_2, ncclFuncReduceScatter, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ reads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cppn, work); | ^:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), n /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u64_4, ncclFuncReduceScatter, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ threads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u64_2, ncclFuncReduceScatter, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u64_2, ncclFuncReduceScatter, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u64_2, ncclFuncReduceScatter, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u64_2, ncclFuncReduceScatter, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u64_2, ncclFuncReduceScatter, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u64_2, ncclFuncReduceScatter, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u64_2, ncclFuncReduceScatter, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u64_2, ncclFuncReduceScatter, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u64_4, ncclFuncReduceScatter, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u64_4, ncclFuncReduceScatter, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u64_4, ncclFuncReduceScatter, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMP/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u64_4, ncclFuncReduceScatter, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u64_4, ncclFuncReduceScatter, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u64_4, ncclFuncReduceScatter, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u64_4, ncclFuncReduceScatter, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u64_4, ncclFuncReduceScatter, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u64_4, ncclFuncReduceScatter, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u64_4, ncclFuncReduceScatter, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 11 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ ; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.hIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ :29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u8_2, ncclFuncReduceScatter, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u8_2, ncclFuncReduceScatter, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u8_2, ncclFuncReduceScatter, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u8_4, ncclFuncReduceScatter, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u8_2, ncclFuncReduceScatter, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:79:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 79 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(ReduceScatter_RING_LL128_SumPostDiv_u8_2, ncclFuncReduceScatter, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u8_4, ncclFuncReduceScatter, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u8_2, ncclFuncReduceScatter, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u8_2, ncclFuncReduceScatter, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u8_2, ncclFuncReduceScatter, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u8_4, ncclFuncReduceScatter, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u8_2, ncclFuncReduceScatter, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u8_4, ncclFuncReduceScatter, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u8_2, ncclFuncReduceScatter, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u8_4, ncclFuncReduceScatter, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u8_2, ncclFuncReduceScatter, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 2>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 2>, 2>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u8_2, ncclFuncReduceScatter, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u8_4, ncclFuncReduceScatter, FuncSumPostDiv, uint8_t, NCCL_RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreaAdLGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tids), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670(:tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u8_4, ncclFuncReduceScatter, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u8_4, ncclFuncReduceScatter, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u8_4, ncclFuncReduceScatter, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u8_4, ncclFuncReduceScatter, FuncSum/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(grPosoup), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2, 4>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce_scatter.h:65:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2, 4>, 4>' requested here 65 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(ReduceScatter_RING_SIMPLE_SumPostDiv_u8_4tDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4), ncclFuncReduceScatter, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx906. 1111 warnings generated when compiling for gfx1101. warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx908. 12 warnings generated when compiling for gfx90a. [ 95%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_bf16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_bf16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_bf16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_bf16.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1200. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 12 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ [ 96%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_bf8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_bf8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_bf8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_bf8.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:In file included from 174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7:/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp :2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.hwarning: :11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:unused variable 'w' [-Wunused-variable]174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75 :7: warning: unused variable 'w' [-Wunused-variable] 75 | 75 barri | er_by_ group( ); | ^~~~~~~~~~~~~~~~~~b a/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.hrri:29e:r_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2 : In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.hu:145:14:i warning: unused variable 'data1' [-Wunused-variable] n145 | t uin3t32_t2 data1,_ flagt1, d ata2, fdlag2;a | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.ht:145:21a: warning: 1,unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.hIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; :29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.hIn file included from :145:28: warning: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cppunused variable 'data2' [-Wunused-variable]:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h :11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ : In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flagIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | 2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1const int w = threadIdx.x/WARP_SIZE; \ | ^ , flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf16_2, ncclFuncReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf16_2, ncclFuncReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf16_2, ncclFuncReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf16_4, ncclFuncReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf16_2, ncclFuncReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf16_2, ncclFuncReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf16_2, ncclFuncReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf16_2, ncclFuncReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf16_4, ncclFuncReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf16_2, ncclFuncReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf16_2, ncclFuncReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf16_2, ncclFuncReduce, FuncSum, hip_bfloat16, NCCL_ALG/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(O_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf16_4, ncclFuncReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_Sum_bf16_2, ncclFuncReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf16_4, ncclFuncReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf16_4, ncclFuncReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hSTEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex:670:15:, warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSi worzk->connInedex); | _ ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | = runRi= 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/ng(tid, nCthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf16_4, ncclFuncReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ CL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf16_4, ncclFuncReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf16_2, ncclFuncReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf16_4, ncclFuncReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf16_4, ncclFuncReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf16_4, ncclFuncReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf16_4, ncclFuncReduce, FuncSum, hip_bfloat16, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2;In file included from | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint3/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp2:_2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flaIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ g1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.hIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11:: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: 145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.hui:80:5: warning: unused variable 'w' [-Wunused-variable]n 80 | t In file included from 32_t d barriera_tbya1, flag1, data2, flag2;In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] _group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = thr/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from e/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:a174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7d: warning: unused variable 'w' [-Wunused-variable] 75 | I badrrier_by_gxroup();. | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.hx:29:15: note: expanded from macro 'barrier_by_group' / 29 | WARP_S c2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ onsIt ZiE; nt w = threadIdx.x/WARIn file included from 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1\ | ^ ,In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ flag1, data2, flag2; | ^~~~~ P_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uIn file included from int3/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:22: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11_: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271t:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | d a ta1, flag1, d auint64_tt* pta2, flagIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 2; r = rec| vPtr(0)+ ^~~~~ ll/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 128Offset; 145| ^~~ | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2In file included from : In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h::75:7: warning: 11unused variable 'w' [-Wunused-variable] 75 | : barrIn file included from i/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.her:_by80:5: _gwarning: roupunused variable 'w' [-Wunused-variable] ();80 | | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h barrie:29r:15: note: expanded from macro 'barrier_by_group' _29 | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf8_2, ncclFuncReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf8_2, ncclFuncReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:1163 | ru: nRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf8_2, ncclFuncReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizesIn file included from [NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf8_2, ncclFuncReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf8_2, ncclFuncReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(tIn file included from hreadId/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:x11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: ./builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: xwarning: initializer order does not match the declaration order [-Wreorder-ctor] 670) | tid(tid, group(group), | ), nthreads(nthreads), tidInBlock(threadId ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ x.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nt(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, hreads, &ring->prev, &rinwork->redOpArgg->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRingconnInd COLL_UNROLLex, >(tid, nthreads, work);work->connI | ^ ndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf8_2, ncclFuncReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf8_4, ncclFuncReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf8_2, ncclFuncReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: tid(tid), nthreads(nthreads), tfield 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] idInBlock(threadIdx.x),506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | g r opurpi(mgsr(otuipd),, nt h| r ^~~~~~~~~~~e ads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_Sum_bf8_2, ncclFuncReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : sIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(RtepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf8_2, ncclFuncReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ educe_RING_SIMPLE_Sum_bf8_2, ncclFuncReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf8_2, ncclFuncReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ (); \ | ^| /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf8_4, ncclFuncReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(t/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tihreaddIdx.x),( group(grtoup), | i ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670d:60: note: field 'group' will be initialized after field 'stepSize' )670 | t,id(tid) , nthreands(nthreads), tidIntBlock(threhadIdx.x),r group(egroup), a| ^~~~~~~~~~~ ds(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), ti(tidd, nthreIads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here OLL_UN ROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_nccl33 | prims(tid, nthreads,In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h: &ring->prev, &ri11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: ng->next, work->sendwarning: initializer order does not match the declaration order [-Wreorder-ctor] buff, work->recv 670 | tid(tid), nthrebuads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf8_4, ncclFuncReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ DevFunc(Reduce_RING_SIMPLE_Sum_bf8_4, ncclFuncReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_ISIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWoMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorrkColl().run(tid, subtn, wokBrk); | ^ atch, 1, 2, 2>::run' requested here> , algo, pr o7t | oD, EFuInNrEo_lnlc>c(l)D.ervuFnu(n)c;( R\e d u| c ^e _RING_SIMPLE_Su/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hm:_670b:f158:_ 2note: ,field 'nthreads' will be initialized after field 'tidInBlock' n cclFuncR e670d | uc e , tFiudn(ctiSdu)m,, rntchcrl_ebafdsl(onatt8h,re aNdCsCL)_,A tLiGdOI_nRBIlNoGc,k (NtChCrLe_aPdRIOdTxO._xS)I,M PgLrEo,u p2()g r o| u^p ), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h62::670 :note: 60expanded from macro 'DEFINE_ncclDevFunc': note: field 'group' will be initialized after field 'stepSize' 611 | 670 | R u n Wtoirdk(Btaitdc),h ,t iadlIgnoB,l opcrko(ttoh,r euandrIodlxl.>x()),. rgurno(u)p;( g\r o u| p ^) , | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf8_4, ncclFuncReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ dOp, Proto, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf8_4, ncclFuncReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf8_4, ncclFuncReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf8_4, ncclFuncReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf8_4, ncclFuncReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf8_4, ncclFuncReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_bf8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_bf8_4, ncclFuncReduce, FuncSum, rccl_bfloat8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx942. 12 warnings generated when compiling for gfx90a. [ 96%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_f16.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_f16.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_f16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_f16.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp 11 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:177:18: warning: unused variable 'y' [-Wunused-variable] : 77 | In file included from uint32_t/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h y, head,: mantiss12a; | ^ : In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: 77In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77: | 18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, m antissa; | ^ uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t da\ | ^ ta1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+llIn file included from 128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: In file included from note: expanded from macro 'barrier_by_group' 29 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: const int w = threadIdx.x warning: unused variable 'w' [-Wunused-variable] /75 | bWarrier_byA_grouRP_SIZE; \ | ^ p(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w =In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdxIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: .In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flaIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ g1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_groIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.hup(:); | ^~~~~~~~~~~~~~~~~~ 75/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15:: note: expanded from macro 'barrier_by_group' 729 | co:nst int w = thrwarning: eadIdx.x/unused variable 'w' [-Wunused-variable]WARP_SI ZE;In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 75 | \ | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h ^ In file included from :29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | In file included from ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h::75:7: warning: 11unused variable 'w' [-Wunused-variable] 75 | : baIn file included from rrier_b/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.hy_group:(); | 174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145 ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE;:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, fla g\ | 1 ^ In file included from , data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.hIn file included from :145:28: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2warning: : In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11unused variable 'data2' [-Wunused-variable]: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h: 174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145 :14: warning: unused variable 'data1' [-Wunused-variable] 145145 | | uin t32_t data1 , uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint3/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:22: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11_: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174t: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145: 14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t daflagt1, daata2, flag2; | 1 ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h,:145:21: flag1, data2, warning: unused variable 'flag1' [-Wunused-variable] f145 | l uaint3g2_t data12, fl;ag 1, d ata2| , ^~~~~flag 2; /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h| ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145::28: warning: 145unused variable 'data2' [-Wunused-variable] :145 | 21 ui:nt32 _twarning: unused variable 'flag1' [-Wunused-variable] data1, fldata1, flag1, data2, flag2; | ^~~~~ 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] ag 1, data1452, f | lag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h :145:35 : warning: unused variable 'flag2' [-Wunused-variable]u i145 | n uitnt323_t d2ata_1, ftlag1 , dadta2,a flatg2; a | ^~~~~ 1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const intIn file included from w = threadIdx.x/WARP_In file included from SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SI/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); ZE; \ | ^ | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w In file included from = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ int64_t* ptr = reIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ cvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f16_2, ncclFuncReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f16_2, ncclFuncReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : steIn file included from pSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE] | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432/NC:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f16_4, ncclFuncReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ CL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sen/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.hdbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^:508 :29: warning: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.hfield 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] :63:5:506 | tid(tid), nthreads(nthrea note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RundWs), woid(tird%WAkRP_SICZE), woarp(tlid/WAlRP_SI().run(tid, subtn, woagTrhreadk((tid%)4)==3;), gr oup(g roup)| , | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ ^ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp stepS:7:1:ize(ncclShmem.comm.buffSizes[N note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f16_2, ncclFuncReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, CCL2_PRO)TO_L L128 ]/NC| CL_S^TEPS /siz/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.heof(uint64:_t))611 { :| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h62::33:7 : note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested herenote: 33expanded from macro 'DEFINE_ncclDevFunc' | prims(t611id, | nthr eads , &r ing-> prevR, &ruing->nnextW, woork->rskBatch, algo, proto, unrendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->conoll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | nIn dex ); t | ^i /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.hd:77(:5:t note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoLL128, 2>' requested herei d77 | ) ,run Rinngr(tied, anthdreasds,) wo,rk); | t ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hid:432:I78: nnote: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here B 432 | l o c ikf ((tid t< shubtn) rRuneWorakColdl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_Sum_f16_2, ncclFuncReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPSIn file included from /sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp| group(group :/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:233:7:: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here In file included from 33 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h : pr11ims(: tidIn file included from , nt/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.hhread:s, 173&ring: ->pr/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hev, :&rin670g->n:ext,15 wor: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkCollsenrdbuffe, woark->drecvIbuffd, woxrk->.redOxpArg,) 0, ,work- >congnIndrex,oup(group), work->connIndex); | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STAElgo, Proto, COLL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f16_4, ncclFuncReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), gPS/sizeof(T) : stepSize_) { | roup(group), | ^~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here| 63 | group(group ru/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.hnRing(t id,note: nthin instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested herereads , wo rk); | ^ 33/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h: | 432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < spubtnr) RuinWormkCosllprev, &ring->nextroto, COLL_UNROLL>().run(tid, subtn, work); , work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp63:7: | 1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEF INE _nccrlDevuFuncn(RedRuce_iRINGn_SIMgPLE_S(ticSudm, ,half , NnCCLt_ALhGO_rRINeG, aNCCdL_PsROTO,_SI MPLwE, o2) r | ^k /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h):611:62;: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | | ^ Ru nWo/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hrkBatc:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid h, algo, prot< subtn) RunWorkColl().run(tid, subtn, work); | ^ o,/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp unroll>:().7run:();1 \ : | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hnote: :670:15:in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | t7id( | tiDd),E ntFhreIadsN(ntEhre_adsn), tidcInBlcockl(thDreevFunc(ReaddIudcxe.x_),R gIroNupG(g_rSouIp)M, P| ^~~~~~~~~~~~~~~~~L /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hE:_670:60S: unote: field 'group' will be initialized after field 'stepSize'm _670f | 1 6 _ti2d(,ti d)n, cntchlFuncReduce, FuncSum, half, NCCL_ALGO_RING,reads(nthreads), tidInBlock(threadIdx.x), group(grou NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); p\), | ^~~~~~~~~~~| ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f16_2, ncclFuncReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f16_2, ncclFuncReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f16_2, ncclFuncReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f16_2, ncclFuncReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(t:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f16_2, ncclFuncReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ threads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f16_2, ncclFuncReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f16_4, ncclFuncReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f16_4, ncclFuncReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f16_4, ncclFuncReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f16_4, ncclFuncReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threa/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | dIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f16_4, ncclFuncReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuf/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hf, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f16_4, ncclFuncReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f16_4, ncclFuncReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthread/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] s), tidI 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f16_4, ncclFuncReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ nBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f16.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f16_4, ncclFuncReduce, FuncSum, half, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. 12 warnings generated when compiling for gfx90a. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1102. [ 96%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_f32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_f32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_f32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_f32.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx1030. 12 warnings generated when compiling for gfx90a. [ 96%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_f64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_f64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_f64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_f64.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_ utint32_t y , head, manytissa; | ^ , head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data 2uint64_t,* ptr = recvPtr(f0)+ll12l8Offseta; | ^~~ g2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | In file included from const int w = threadIdx.xIn file included from /WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: In file included from unused variable 'w' [-Wunused-variable] 75 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7 : warning: unused variable 'w' [-Wunused-variable] 75 | bar rier_by_g roup(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15:b note: expanded from macro 'barrier_by_group' 29 | a const irnt w = thrreadIdx.xi/WARP_SIZeEIn file included from r_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ : note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ In file included from | ^/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable]/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h: 174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75145 | bar | rier_by_gro up uint32_t d(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ ata1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | In file included from uint3/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:22: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:_t data1, flag1, data2, flag2; | ^~~~~ 11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t dIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ ata1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_groupIn file included from (); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = tIn file included from hreadIdx/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: .In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: xIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: //builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5W: warning: unused variable 'w' [-Wunused-variable] A 80 | R barrPier_b_y_groSup(IZE; \ | ^ ); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: :In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:1111: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:: 175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.hIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:271::19: warning: unused variable 'ptr' [-Wunused-variable]174 271: | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h u:int6754_:t* pt7r =: recvP tr(0)warning: +ll12unused variable 'w' [-Wunused-variable]8Offs et; | ^~~ 75 | In file included from barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp; | ^~~: 2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_ by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | c uint64_tonst int w = threadIdx.x/WARP_SIZE; \ | ^ * ptr = recvPtr(0)+ll128Offset; | ^~~ : /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uinIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ t32_t data1,/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ ZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flagIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | constIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f32_2, ncclFuncReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor]In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm 506 | . tid(tid), bnthreads(ntuhreads)f, wid(tid%fWARP_SIZE), Swarp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3In file included from ),/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2 : In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173g: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] r 670 | tid(otid), nthrueads(nthrepads), tidIn(Block(thrgeadIdx.x),r group(grouop), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | u tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | steppSize(ste), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | pSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->conn warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | I stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next,O_S IMPLE]w/NCCL_oSTEPSr/sizeokf(T) : -stepS>ize_) s{ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ e| group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.hn:33:7: dnote: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33b | uprims(ftid, nfthread,s, &rin g->pwrev, &oring->rnext, wkork->s-endbuff>, workr->recvebuff,ncd ex, wvowrk-b>connIundexo)f; | ^rf /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:k63,:5:- note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here >63 | rrunRinegconnIndex, worwork->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_Sum_f32_2, ncclFuncReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ k_UN-ROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl>connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn,().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f32_2, ncclFuncReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' wor k); 611| ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp | : 7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DERFINuE_nncclWDevoFunrc(RkeduBce_aRINtG_ScIMPhLE_S,62 al:go, prnote: otoexpanded from macro 'DEFINE_ncclDevFunc', u nr oll>(611).r | un( ); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h :R670:15u: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | n W tiod(trid)k, nBthraeads(ntthreacds)h, t,ck(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threa algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), grdIdx.x), group(group), | ^~~~~~~~~~~ oup(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f32_2, ncclFuncReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f32_2, ncclFuncReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f32_2, ncclFuncReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f32_2, ncclFuncReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid stepSize(stepSi(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group z/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.he_ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670== 0 | ? ncclShmem.comm.buff Sizes[NCCL_ PROTO_SIMPL E]/NCCL_ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f32_2, ncclFuncReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ STEPtS/sizeIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: of(T) : stepSwarning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f32_2, ncclFuncReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ ize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, w/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ id(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCC/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] L_STEPS/sizeof(T) 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7ork->In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f32_2, ncclFuncReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidredOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | i: Inote: nBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here f 33 | prims((tid, ntidthre runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f32_4, ncclFuncReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ a ds, &rinpr ev, &risng->neuxt, work->sendbufbf, work-t>recvbunff, work)->redOp Arg, 0,R work->unWorkCollconnIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f32_4, ncclFuncReduce, FuncSum, float, NCC_UNLROLL>_(tid,A nthrLeads,G work); | O ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:_432:78: Rnote: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432I | N if (Gtid < ,subtn) RunWNorkCoCllO()_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:.run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMP611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), LE_Sunm_f32t_4, nhcclFuncReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), reads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | snthrteads(enthrepads),S tidIinBlockz(three(adIdsx.x),t groupe(groupp), S| ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hi:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ze_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f32_2, ncclFuncReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f32_4, ncclFuncReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f32_4, ncclFuncReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f32_4, ncclFuncReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthread/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ s, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f32_4, ncclFuncReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_S/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hIMP:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | LE, 4 ) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.ht:611:62: inote: expanded from macro 'DEFINE_ncclDevFunc' 611 | d Ru(tidnWo)r, nthkBatch, algo, protoreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(s, unrtoll>()e.run()p; \ | S ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670i:15: note: field 'nthreads' will be initialized after field 'tidInBlock' z 670 | e tid(_tid), nthrea=ds(= 0 ? ncclShmem.comm.buffSizes[NCntChreadsL), tidI_nBlockP(threaRdIdx.xO), grTO_SIMPLE]/oNup(groCup), C| L_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h :670:60: | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), gr 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recouvp(gbroup)u, | f ^~~~~~~~~~~ f, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f32_4, ncclFuncReduce, FuncSum, float, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hwork->recvbuff, w:670:ork->15: warning: initializer order does not match the declaration order [-Wreorder-ctor]redOpAr 670g | , 0, work tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_->conn Index, w ork->connIndex);671 | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h | :63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(ti/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hd, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, 15: warning: initializer order does not match the declaration order [-Wreorder-ctor]& 670 | trid(tidi), nnthreadgs(nth-reads>), tipdInBlrock(tehreadvIdx,.x), groLL&u_UNRrpOLL>(i().runng(tigdr, s-uobtn>,u wonrpk);e ) | ^ x,/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:12t :1: , note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here work->sendb| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~u | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ f671 | f s12 | DEFtINE_encclDpevFunSc(Rediuce_RzING_SeI(stepSMPiLE_Suzm_, fework_3->re == 0 ? ncclShmem.comm.bu2f_4, fncclFSuncRizes[NCCL_PReduOce, TFuncSOum, _floaSt, NICCL_ALGO_RIMNGc,Pvbu Lff,NE woC]rk-C/>rLeNdO_pACrPg,C R0,L Owor_Tk->cOSonnIT_ndeESx, PIworSMk->/PconsLnIniEdexz,);e | ^o 4/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.hf:)63:(5 : Tnote: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here ) | 63 | ^ : ru /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hnRsite:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611n | g(tid,11 warnings generated when compiling for host. ntpSize_)h { | r ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | e group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.ha:33:7d: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested heres 33, | prims(work)t;id, nt hrea| ds,a ^ tch &>,78 p:alrg o,e note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here v432, &r | ipro nto, g un -rol >l>( n).r eun()ix; \ft (ti, wdork -| ^< /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h >:s670:s15e: unote: nfield 'nthreads' will be initialized after field 'tidInBlock' btn) dbuff, work->recvbuff, work->redOpArg, 0, 670 | w tido(tidr), nkthre-ads(>nthrceadso), tnidInnBlocIk(thnreaddIdxRe.unWxxork,)C work->connIndex); | ^, grou/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.hp(group):, | 63 ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h::670:60:5 note: ofield 'group' will be initialized after field 'stepSize':ll, ProtoSimple<1, 1, 4>, 4>' requested here Re670 dOp | , A lgo63 , P | rot o, C tOLL i_UN dROLr(L>(ut).riun(tdnid,) subtn, work);, | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_RnincgE(ltoci_kdS(,ut mhn_rtfeh3ar2de_Ia4dd,xs .,nx c)wc,ol rFgkur)no;cu Rp e(| dg ^u ro/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hceu,: 432pF:)u78,n: c Snote: uin instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here| m , ^~~~~~~~~~~ float, NCCL_ALGO_RING, NCCL432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_nccRunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tidlD(evtFuincd(R)ed,uc e_nRItNGh_SrIMePLaE_dSusm_(f3n2_t4,h nrccelFaundcRsed)u,ce , tFuincdSuIm,nBlock(threadIdx.x) ,flo agt,r NoCCLu_ApLG(O_gRrINoG,u NpCC)L_,PR OT O_| SI ^~~~~~~~~~~~~~~~~MPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch ,/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h: 670:a60:l note: gfield 'group' will be initialized after field 'stepSize' o 670, | p trido(ttido),, nt hrueandsr(nothlrela>ds)(, )ti.drIunBnlo(ck)(t;hr ea\dI dx .x)| , ^gr o/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hup(group), | ^~~~~~~~~~~ :670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.hIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ :15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: In file included from note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = thrIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ eadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15:In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | unused variable 'flag1' [-Wunused-variable] 145 | uint32 _t datab1, flaga1, datar2rier_by_group(), fla;g2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: | warning: ^~~~~~~~~~~~~~~~~~ unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from flag/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp2; | ^~~~~: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:235: warning: unused variable 'flag2' [-Wunused-variable] : 145 | In file included from uin/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.ht32_t :data1,11 flag1: , dataIn file included from 2, fl/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.hag2; :| ^~~~~ 174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t daIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ ta1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | consIn file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:t int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = t/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.hhre:adIdx.145x/WARP:_SIZ14: warning: unused variable 'data1' [-Wunused-variable] E;145 \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uin11t: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, dIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | baratra2, flaig2; e| ^~~~~ r_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const intIn file included from w = threadI/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | dx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f64_2, ncclFuncReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f64_2, ncclFuncReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f64_2, ncclFuncReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f64_2, ncclFuncReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h tid(tid):670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f64_4, ncclFuncReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ , nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f64_2, ncclFuncReduce, FuncSu m | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f64_4, ncclFuncReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f64_2, ncclFuncReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f64_2, ncclFuncReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlo/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cppc:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: kIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15(: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | t tid(tihd), nthrreadsea(ntdIdx.x), grhreads), otidup(groInBlouckp(), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ threadIdx671.x), g | stepSize(stepSize_ == 0 r?oup(grou p), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ n| tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_cclShm 671 | e stepSimze(.csteommp.SizbuffSizes[NCCLe_ == 0_ ? ncclShPROTO_SIMPLE]/mem.comm.NbuffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, iwzeof(T) ork->sendb: ustepSifze_) { f | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group, /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h :33:w7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here o33 | prims(tird, nthrkeads, -&ring->>prev,r &ringe->nextc, work-v>buff, work->redOsenpdbuff,A work-r>recvbg,uf 0, work->connIndex, work->connIndex)f, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl(ti,d, nt hreaPds, rwork);o | ^ t/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:o78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here, 432 | C if (Otid ().run(tid, subtn,oto, COLL_ UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f64_2, ncclFuncReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f64_2, ncclFuncReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f64_4, ncclFuncReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f64_4, ncclFuncReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidI/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSiznBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ e_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f64_4, ncclFuncReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f64_4, ncclFuncReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f64_4, ncclFuncReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ OLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_Sum_f64_2, ncclFuncReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f64_4, ncclFuncReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWoIn file included from rkColl().run(tid, /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.hsubtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RI:11N: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:G670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] _ 670 | SIMPLE_Sum_f64_2, ncclFunc tRid(tide), ntdhreadus(nthrceadse), t, FuncSum, doidInBulock(bthrealdIdx.ex), g,roup( group)N, | C ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | C tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671L | _stepALGO_RING, NCCSizeL(step_Size_P ==ROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 0 | ? ncc lShme m.comm .buff SizeRs[NCCuL_PROnTO_SWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthr estepSaize_) d{ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s | ) group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h,:33:7 : note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here t 33 | i dprims(Itid, nnthreBads, &ringl->preov, &rcing->nkext, (work->tsendbhuff, readworIdx.x), grok->reucvbupff(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670, :work-60>red:OpAr g, 0,note: wofield 'group' will be initialized after field 'stepSize' 670 | tid(tirkd->con)nInde,x, wor k->cnthreadsonn(Indexn)threads), tidIn; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | Block(threadIdx.x), group(group), | ^~~~~~~~~~~ runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] subtn, 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f64_4, ncclFuncReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f64_2, ncclFuncReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f64_4, ncclFuncReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f64_4, ncclFuncReduce, FuncSum, double, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1102. 12 warnings generated when compiling for gfx90a. [ 97%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_f8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_f8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_f8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_f8.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx942. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 12 warnings generated when compiling for gfx90a. In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h: In file included from :/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from 14/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h: :77:18:/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h warning: unused variable 'y' [-Wunused-variable] :77 | uint32_t y, head, mantissa; | ^ 77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ [ 97%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_u32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_u32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_u32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_u32.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | In file included from uint32_t data1, flag1, data2, flag2; /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2 : In file included from | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h ^~~~~ :11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2:: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h14: warning: :unused variable 'data1' [-Wunused-variable] 14511: | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5 : warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdxuint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ .x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | In file included from const int w = thread/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ Idx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: In file included from warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f8_2, ncclFuncReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f8_2, ncclFuncReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f8_2, ncclFuncReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f8_4, ncclFuncReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f8_2, ncclFuncReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_Sum_f8_2, ncclFuncReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f8_2, ncclFuncReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f8_2, ncclFuncReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f8_4, ncclFuncReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f8_4, ncclFuncReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f8_4, ncclFuncReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f8_2, ncclFuncReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hIn file included from :670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f8_2, ncclFuncReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f8_4, ncclFuncReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f8_4, ncclFuncReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f8_2, ncclFuncReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f8_2, ncclFuncReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f8_2, ncclFuncReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f8_4, ncclFuncReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f8_4, ncclFuncReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f8_4, ncclFuncReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f8_4, ncclFuncReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_f8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_f8_4, ncclFuncReduce, FuncSum, rccl_float8, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cppIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ :2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ Idx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrierIn file included from _by_gro/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2u: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from p/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ (); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const inIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ t w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uiIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ nt32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, In file included from flag1, data2, flag/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2 /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ : In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from 2/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h: :145:In file included from 14: warning: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.hunused variable 'data1' [-Wunused-variable] 145 | : uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* pIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ tr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_Sum_u32_2, ncclFuncReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u32_2, ncclFuncReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u32_2, ncclFuncReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u32_2, ncclFuncReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u32_2, ncclFuncReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.hIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? nc:cl33S:h7m:e mnote: .cin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested hereo mm.buffSizes[NCCL_ P33R | O T O _ S I MpPrLiEm]s/(NtCiCdL,_ SnTtEhPrSe/asdisz,e of&(rTi)n g:- >sptreepvS,i z&er_i)n g{- > n| e ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~x t ,| group(groupw ork->sendbuff, work->recv/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.hb:u33f:f7,: wnote: oin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested herer k->redOpAr g33, | 0 , w o rpkr-i>msc(otnindIn, dnetxh,r ewaodrsk,- >&croinnngI-n>dperxe)v;, &| r ^i ng->next, work->send/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.hb:u63f:f5,: wnote: oin instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested herer k->re c63v | b u f f ,r uwnoRrikn-g>CcOoLnLn_IUnNdReOxL,L >w(otrikd-,> cnotnhnrIenaddesx,) ;w o r| k ^) ; | ^/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h :63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h :r432u:n78Ri:n note: gin instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here< T, RedOp, 432P | r o t o , CiOLfL (_tUiNdR Osu(bttind), RnuthnrWoerakdCs,o llw, 1, 2, 2>::run' requested here, COLL_U N432R | O L L > ( ) .irfu n((ttiidd ,< ssuubbttnn,) wRournkW)o;r k C| o ^l l, 1, 2, 2>::run' requested herel go, Prot o7, | CDOELFLI_NUEN_nRcOcLlLD>e(v)F.urnucn((Rteidd,u ces_uRbItNnG, _wSIoMrPkL)E;_ S| u ^m _u32_2, /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cppn:c7c:l1F:u nnote: cin instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested hereR educe, F u7n | cDSEuFmI,N Eu_innctc3l2D_etv,Fu nNcC(CRLe_dAuLcGeO__RRIINNGG_,S INMCPCLLE__PSRuOmT_Ou_3S2I_M2P,L En,c c2l)F u n| c^R educe, F/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.hu:n611c:S62u:m ,note: expanded from macro 'DEFINE_ncclDevFunc'u int32_ t611, | N C C LR_uAnLWGoOr_kRBIaNtGc,h 2,) a l| g^o , prot/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.ho:,611 :u62n:r onote: lexpanded from macro 'DEFINE_ncclDevFunc'l >().r u611n | ( ) ; \ R u| n ^W orkBatch ,670 | a l go , t ipdro(ttiod,) u,n rnotlhlr>e(a)d.sr(unnt(h)r;e a\d s )| , ^ tidInBl/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.ho:c670k:(15t:h rnote: efield 'nthreads' will be initialized after field 'tidInBlock'a dIdx.x) ,670 | g r o u pt(igdr(toiudp)),, n t| h ^~~~~~~~~~~~~~~~~r ead/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hs:(670n:t60h:r enote: afield 'group' will be initialized after field 'stepSize'd s), ti d670I | nB l o c kt(itdh(rteiadd)I,d xn.txh)r,e agdrso(unpt(hgrroeaudps)),, t| ^~~~~~~~~~~~~~~~~i dInB/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hl:o670c:k60(:t note: hfield 'group' will be initialized after field 'stepSize'r eadIdx .670x | ) , g rtoiud(pt(igdr)o,u pn)t,h r e| ^~~~~~~~~~~ad s(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u32_2, ncclFuncReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u32_2, ncclFuncReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u32_2, ncclFuncReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u32_2, ncclFuncReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u32_4, ncclFuncReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u32_4, ncclFuncReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring-:670:15: warning: >initializer order does not match the declaration order [-Wreorder-ctor] 670 | p tid(rtid), ntehreadsv(nthre,ads), t idInBloc&k(threadIdx.x)r, groupi(group),n | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_g ->next, work->sendbuff, work 671 | s-tepSiz>e(steprSize_ ==e cvbuff, work->redOp0 ? ncclShmem.comm.buffSizes[NCCL_PROTArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (O_SIMPLtE]/NCCL_iSTEPS/sidzeof(T) : stepSi, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here b 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOp/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(thretn) RaunWorkdColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u32_4, ncclFuncReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSizdx.x), group(group), | Arg, 0, work->connIndex, work->connIndex); | ^e_ == 0 /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group 63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here RedOp, 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | st epSize(st epSize_ == 0 ?i ncclShmfem.co mm.buffSizes[NCCL_(PROtid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u32_4, ncclFuncReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ TO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunW orRkeCdoOlpl,< FAnl,g oT,, PRreodtOop,, CAOlLgLo_,U NPRrOoLtLo>,( )C.OrLuLn_(UtNiRdO,L Ls>u(b)t.nr,u nw(otrikd),; s u| b ^t n, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp12::112:: 1note: :in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFI N12E | _DnEcFcIlNDEe_vnFcucnlcD(eRveFduuncce(_RReIdNuGc_eS_IRMIPNLGE__SSIuMmP_LuE3_2S_u4m,_ un3c2c_l4F,u nnccRceldFuucnec,R eFduuncceS,u mF,u nuciSnutm3,2 _uti,n tN3C2C_Lt_,A LNGCOC_LR_IANLGG,O _NRCICNLG_,P RNOCTCOL__SPIRMOPTLOE_,S I4M)P L E| ,^ 4) | ^/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h :611:62: note: expanded from macro 'DEFINE_ncclDevFunc'/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h :611:62: note: 611expanded from macro 'DEFINE_ncclDevFunc' | Ru n611W | o r k B aRtcuhn, ,r ealdgopo<,t yp>r,o taol,g ou,n rporlolt>o(,) .urnurno(l)l;> (\) . r| u ^n (); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: 670field 'nthreads' will be initialized after field 'tidInBlock' | tid(t i670d | ) , n tthirde(atdisd()n,t hnrtehardesa)d,s (tnitdhIrneBaldosc)k,( tthirdeIandBIldoxc.kx()t,h rgeraoduIpd(xg.rxo)u,p )g,r o u| p ^~~~~~~~~~~~~~~~~( grou/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hp:)670,: 60 :| ^~~~~~~~~~~~~~~~~note: field 'group' will be initialized after field 'stepSize' /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670: 60670: | note: field 'group' will be initialized after field 'stepSize' tid(t i670d | ) , n tthirde(atdisd()n,t hnrtehardeasd)s, (tntihdrIenaBdlso)c,k (ttihdrIenaBdlIodcxk.(xt)h,r egardoIudx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x)p(group), | ^~~~~~~~~~~ , group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u32_4, ncclFuncReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u32_4, ncclFuncReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u32_2, ncclFuncReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), g/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u32_4, ncclFuncReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ roup(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u32_4, ncclFuncReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u32_4, ncclFuncReduce, FuncSum, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1200. 1111 warnings generated when compiling for gfx1102. warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1100. 12 warnings generated when compiling for gfx90a. [ 97%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_u64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_u64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_u64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_u64.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp 12 warnings generated when compiling for gfx90a. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: [ 97%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_u8.cpp.o In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_u8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_u8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_u8.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | u/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ int32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int wIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = thre =a threadIddx.x/WARP_SIIZE; \ | d ^ x.x/WAIn file included from RP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128OffseIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_grou 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ p(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | t; c | ^~~ onst int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, In file included from flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, f/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ lag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = reIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] c vPtr(0)+ll12758Offset | ; | ^~~ barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | cIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ onst int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21f: warning: unused variable 'flag1' [-Wunused-variable] 145 | l uinta32_t datag1, fla2g1, data2;, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145 | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, fla:28: gwarning: unused variable 'data2' [-Wunused-variable] 1145 | ,uint32 _t datda1, flaag1, dtata2, aflag2; 2 | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h,:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | f uint3l2_t daata1, flgag1, data2, flag2; | ^~~~~ 2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barri| ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ er_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr =/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp :2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:r11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175e: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ cvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ adIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u64_2, ncclFuncReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u64_2, ncclFuncReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthrIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(neads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u64_2, ncclFuncReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ threads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u64_2, ncclFuncReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u64_4, ncclFuncReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u64_2, ncclFuncReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u64_2, ncclFuncReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connInde/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] x); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63: 5670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here : note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u64_4, ncclFuncReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u64_4, ncclFuncReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ lgo, Proto, COLL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u64_4, ncclFuncReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? nccl 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u64_2, ncclFuncReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Shmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u64_4, ncclFuncReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u64_2, ncclFuncReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u64_4, ncclFuncReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_Sum_u64_2, ncclFuncReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.cIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tiIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u64_2dInBlock(threaomm.buffSdizes[NCCIL_PROTO_dSIMPLEx]/NCCL_S.TEPS/sizeof(T) : xstepSize)_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ , | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h :33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here g 33 | r prims(toid,, nccl FuncReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ nthureads,p(group), &| ring->p ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~rev, | & tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | rings->next,tepSize(st work->esendpSize_ ==bu ff, work0- ? ncc>recvbuff, work->redOpArg, lShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_S0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u64_2, ncclFuncReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ TEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u64_2, ncclFuncReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u64_4, ncclFuncReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u64_4, ncclFuncReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u64_4, ncclFuncReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid)/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h, nthreads(:n670:15thre: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work-ads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]>/connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u64_4, ncclFuncReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u64_4, ncclFuncReduce, FuncSum, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h::14: warning: 271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data= recvPtr(0)+ll128Offset; | ^~~ 1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ ZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u8_2, ncclFuncReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u8_2, ncclFuncReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u8_2, ncclFuncReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u8_4, ncclFuncReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u8_4, ncclFuncReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u8_2, ncclFuncReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u8_4, ncclFuncReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u8_2, ncclFuncReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u8_4, ncclFuncReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunW/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.horkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads):508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP, _tidInSBlockI(threZadIdEx.x), )grou,p(grou p), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h| :670:60: ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ note: field 'group' will be initialized after field 'stepSize' | warp(tid/WARP_SIZE 508 | flagThread670 | ( (tid(ttid)i, nthdread%s(nt4hrea)ds),= ti=dInBl3ock()thr,eadI dx.xg), grroup(ogrouup), p | ^~~~~~~~~~~ (group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tiIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u8_2, ncclFuncReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ d, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_Sum_u8_2, ncclFuncReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u8_2, ncclFuncReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] tid(tid), nthread 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u8_4, ncclFuncReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ s(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u8_2, ncclFuncReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &rIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDing->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->ceovFunc(Reduce_RING_SIMPLE_Sum_u8_2, ncclFuncReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ nnIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u8_2, ncclFuncReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u8_4, ncclFuncReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u8_4, ncclFuncReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u8_4, ncclFuncReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u8_2, ncclFuncReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u8_4, ncclFuncReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u8_4, ncclFuncReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sum_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_Sum_u8_4, ncclFuncReduce, FuncSum, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1201. 12 warnings generated when compiling for gfx90a. [ 98%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_i32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_i32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_i32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_i32.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1102. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ 12 warnings generated when compiling for gfx90a. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ [ 98%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_i64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_i64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_i64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_i64.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cppIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPt:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ r(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: :In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: 2In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: : /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:In file included from 271:19/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ :11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i32_2, ncclFuncReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i32_2, ncclFuncReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i32_2, ncclFuncReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i32_2, ncclFuncReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i32_2, ncclFuncReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i32_4, ncclFuncReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i32_2, ncclFuncReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i32_4, ncclFuncReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i32_2, ncclFuncReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i32_4, ncclFuncReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i32_4, ncclFuncReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i32_4, ncclFuncReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i32_2, ncclFuncReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWo/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i32_2, ncclFuncReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ LL128, COLL_UNROLL>(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_SumPostDiv_i32_2, ncclFuncReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i32_4, ncclFuncReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i32_2, ncclFuncReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i32_2, ncclFuncReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i32_4, ncclFuncReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i32_4, ncclFuncReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i32_4, ncclFuncReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i32_4, ncclFuncReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i32_4, ncclFuncReduce, FuncSumPostDiv, int32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ : In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp: 2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_SumPostDiv_i64_2, ncclFuncReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i64_2, ncclFuncReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i64_4, ncclFuncReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i64_2, ncclFuncReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i64_2, ncclFuncReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i64_2, ncclFuncReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i64_2, ncclFuncReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i64_4, ncclFuncReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i64_4, ncclFuncReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i64_4, ncclFuncReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i64_2, ncclFuncReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i64_4, ncclFuncReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i64_4, ncclFuncReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i64_2, ncclFuncReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i64_2, ncclFuncReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i64_2, ncclFuncReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i64_2, ncclFuncReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i64_2, ncclFuncReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i64_4, ncclFuncReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i64_4, ncclFuncReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i64_4, ncclFuncReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i64_4, ncclFuncReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i64_4, ncclFuncReduce, FuncSumPostDiv, int64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx1100. 12 warnings generated when compiling for gfx90a. [ 98%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_i8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_i8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_i8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_i8.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx942. 12 warnings generated when compiling for gfx90a. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1102. [ 98%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_u32.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_u32.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_u32.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_u32.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.xIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i8_2, ncclFuncReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), g/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_SumPostDiv_i8_2, ncclFuncReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ roup(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i8_2, ncclFuncReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i8_4, ncclFuncReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i8_4, ncclFuncReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i8_2, ncclFuncReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i8_2, ncclFuncReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads)15, :t iwarning: dIinitializer order does not match the declaration order [-Wreorder-ctor]n Block(threadIdx.x), gr o670u | p ( g r otuipd)(,t i d| ) ^~~~~~~~~~~ , nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i8_2, ncclFuncReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i8_2, ncclFuncReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i8_2, ncclFuncReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i8_2, ncclFuncReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i8_4, ncclFuncReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i8_4, ncclFuncReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i8_4, ncclFuncReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i8_2, ncclFuncReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i8_4, ncclFuncReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i8_2, ncclFuncReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i8_2, ncclFuncReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i8_4, ncclFuncReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i8_4, ncclFuncReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i8_4, ncclFuncReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i8_4, ncclFuncReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_i8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_i8_4, ncclFuncReduce, FuncSumPostDiv, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174In file included from : /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | conIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 29 | st int w = threadIdx.x/WARP_SIZE; \ | ^ const intIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174 w = threadIdx.x/WARP_SIZE; \ | ^ : /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u32_2, ncclFuncReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_SumPostDiv_u32_2, ncclFuncReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRcomm.buffiSng(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work)izes[NCC; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u32_2, ncclFuncReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ L_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u32_2, ncclFuncReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u32_4, ncclFuncReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u32_2, ncclFuncReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u32_2, ncclFuncReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(gIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | priroup), | ^~~~~~~~~~~ ms(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u32_2, ncclFuncReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hS:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u32_4, ncclFuncReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ TEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u32_2, ncclFuncReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthr/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | eads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u32_2, ncclFuncReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u32_4, ncclFuncReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u32_4, ncclFuncReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u32_4, ncclFuncReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u32_2, ncclFuncReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u32_4, ncclFuncReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatchads(nt,hreads ), tidaInBlgo, proto, unroll>().run()loc;k(thre adIdx.\x), group (gro| u ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670p), | | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | s tid(ttepSize(stepSize_ == 0 ? id), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u32_4, ncclFuncReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u32_2, ncclFuncReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u32_4, ncclFuncReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u32_2, ncclFuncReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u32_4, ncclFuncReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u32_4, ncclFuncReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u32.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u32_4, ncclFuncReduce, FuncSumPostDiv, uint32_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx942. 12 warnings generated when compiling for gfx90a. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx908. [ 99%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_u64.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_u64.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_u64.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_u64.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx906. 12 warnings generated when compiling for gfx90a. 11 warnings generated when compiling for gfx1200. [ 99%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_u8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_u8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_u8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_u8.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, datIn file included from a/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ , data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_SumPostDiv_u64_2, ncclFuncReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u64_2, ncclFuncReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u64_4, ncclFuncReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u64_2, ncclFuncReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u64_2, ncclFuncReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u64_2, ncclFuncReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u64_2, ncclFuncReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u64_2, ncclFuncReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u64_4, ncclFuncReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u64_2, ncclFuncReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u64_4, ncclFuncReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u64_2, ncclFuncReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u64_2, ncclFuncReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u64_2, ncclFuncReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u64_2, ncclFuncReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | p/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nrims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u64_4, ncclFuncReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ threads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u64_4, ncclFuncReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u64_4, ncclFuncReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u8_2, ncclFuncReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u64_4, ncclFuncReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u64_4, ncclFuncReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u64_4, ncclFuncReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u64_4, ncclFuncReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u64.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u64_4, ncclFuncReduce, FuncSumPostDiv, uint64_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u8_2, ncclFuncReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u8_2, ncclFuncReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWor/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor]kBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u8_4, ncclFuncReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u8_2, ncclFuncReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u8_2, ncclFuncReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:508:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 506 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 507 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 508 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 509 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:77:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128, 2>' requested here 77 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 1, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:5:1: note: in instantiation of member function 'RunWorkBatch, 1, 1, 2>::run' requested here 5 | DEFINE_ncclDevFunc(Reduce_RING_LL128_SumPostDiv_u8_2, ncclFuncReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_LL128, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkCollprev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u8_4, ncclFuncReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ o, COLL_UNROLL>().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u8_2, ncclFuncReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u8_4, ncclFuncReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u8_4, ncclFuncReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u8_2, ncclFuncReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u8_2, ncclFuncReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u8_4, ncclFuncReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u8_2, ncclFuncReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for host. In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u8_2, ncclFuncReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u8_4, ncclFuncReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u8_4, ncclFuncReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u8_4, ncclFuncReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 2>, 2>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 2>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:7:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 7 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u8_2, ncclFuncReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u8_4, ncclFuncReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u8_4, ncclFuncReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1, 4>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, work->sendbuff, work->recvbuff, work->redOpArg, 0, work->connIndex, work->connIndex); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/reduce.h:63:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1, 4>, 4>' requested here 63 | runRing(tid, nthreads, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:432:78: note: in instantiation of member function 'RunWorkColl, 1, 2, 4>::run' requested here 432 | if (tid < subtn) RunWorkColl().run(tid, subtn, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/reduce_sumpostdiv_u8.cpp:12:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 12 | DEFINE_ncclDevFunc(Reduce_RING_SIMPLE_SumPostDiv_u8_4, ncclFuncReduce, FuncSumPostDiv, uint8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx942. 12 warnings generated when compiling for gfx90a. [ 99%] Building CXX object CMakeFiles/rccl.dir/hipify/gensrc/sendrecv_sum_i8.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/hipify/gensrc/sendrecv_sum_i8.cpp.o -MF CMakeFiles/rccl.dir/hipify/gensrc/sendrecv_sum_i8.cpp.o.d -o CMakeFiles/rccl.dir/hipify/gensrc/sendrecv_sum_i8.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1201. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx1200. 11 warnings generated when compiling for gfx908. 12 warnings generated when compiling for gfx90a. [ 99%] Building CXX object CMakeFiles/rccl.dir/git_version.cpp.o /usr/bin/hipcc -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_CONTIGUOUS_MEMORY -DHIP_UNCACHED_MEMORY -DNVTX_DISABLE -DNVTX_NO_IMPL -DROCM_VERSION=60401 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -DUSE_ROCM_SMI_THREAD_ONLY_MUTEX -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/network/unpack -I/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -std=c++14 -fPIC -parallel-jobs=12 -Werror=uninitialized -Werror=sometimes-uninitialized -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -x hip --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 -MD -MT CMakeFiles/rccl.dir/git_version.cpp.o -MF CMakeFiles/rccl.dir/git_version.cpp.o.d -o CMakeFiles/rccl.dir/git_version.cpp.o -c /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/git_version.cpp In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:1: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:12: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/collectives.h:15: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/device.h:14: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/include/rccl_float8.h:77:18: warning: unused variable 'y' [-Wunused-variable] 77 | uint32_t y, head, mantissa; | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:7: warning: unused variable 'w' [-Wunused-variable] 7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h: | const int w = threadIdx.x/WARP_SIZE; \ | ^ 173/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:75:In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1,:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:117:: warning: unused variable 'w' [-Wunused-variable] In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:17575 | : /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: barrie 75 | dat a2, flag2; barrie | ^~~~~ r_by_group/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint3(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ 2_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ r_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:75:7: warning: unused variable 'w' [-Wunused-variable] 75 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:174: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:14: warning: unused variable 'data1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:21: warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ : warning: unused variable 'flag1' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:28: warning: unused variable 'data2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll.h:145:35: warning: unused variable 'flag2' [-Wunused-variable] 145 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:80:5: warning: unused variable 'w' [-Wunused-variable] 80 | barrier_by_group(); | ^~~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:29:15: note: expanded from macro 'barrier_by_group' 29 | const int w = threadIdx.x/WARP_SIZE; \ | ^ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:175: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:45:7: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1, 2>, 1>::Primitives' requested here 45 | prims(tid, tn, nullptr, &work->sendRank, work->sendAddr, nullptr, | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:261:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 2>::runSend>' requested here 261 | runSend>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_2, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:103:7: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1, 2>, 1>::Primitives' requested here 103 | prims(tid, tn, &work->recvRank, nullptr, nullptr, work->recvAddr, | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:273:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 2>::runRecv>' requested here 273 | runRecv>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_2, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:45:7: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1, 2>, 1>::Primitives' requested here 45 | prims(tid, tn, nullptr, &work->sendRank, work->sendAddr, nullptr, | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:261:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 2>::runSend>' requested here 261 | runSend>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_2, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:45:7: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1, 8>, 1>::Primitives' requested here 45 | prims(tid, tn, nullptr, &work->sendRank, work->sendAddr, nullptr, | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:257:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 2>::runSend>' requested here 257 | runSend>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_2, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:45:7: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1, 4>, 1>::Primitives' requested here 45 | prims(tid, tn, nullptr, &work->sendRank, work->sendAddr, nullptr, | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:261:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 4>::runSend>' requested here 261 | runSend>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:4:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 4 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_4, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:45:7: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1, 4>, 1>::Primitives' requested here 45 | prims(tid, tn, nullptr, &work->sendRank, work->sendAddr, nullptr, | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:259:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 2>::runSend>' requested here 259 | runSend>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_2, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:103:7: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1, 2>, 1>::Primitives' requested here 103 | prims(tid, tn, &work->recvRank, nullptr, nullptr, work->recvAddr, | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:273:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 2>::runRecv>' requested here 273 | runRecv>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_2, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:45:7: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1, 2>, 1>::Primitives' requested here 45 | prims(tid, tn, nullptr, &work->sendRank, work->sendAddr, nullptr, | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:261:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 2>::runSend>' requested here 261 | runSend>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_2, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:103:7: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1, 4>, 1>::Primitives' requested here 103 | prims(tid, tn, &work->recvRank, nullptr, nullptr, work->recvAddr, | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:273:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 4>::runRecv>' requested here 273 | runRecv>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:4:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 4 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_4, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:103:7: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1, 8>, 1>::Primitives' requested here 103 | prims(tid, tn, &work->recvRank, nullptr, nullptr, work->recvAddr, | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:269:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 2>::runRecv>' requested here 269 | runRecv>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_2, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:45:7: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1, 2>, 1>::Primitives' requested here 45 | prims(tid, tn, nullptr, &work->sendRank, work->sendAddr, nullptr, | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:261:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 2>::runSend>' requested here 261 | runSend>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_2, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(steIn file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), pSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:45:7: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1, 2>, 1>::Primitives' requested here 45 | prims(tid, tn, nullptr, &work->sendRank, work->sendAddr, nullptr, | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:261:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 2>::runSend>' requested here 261 | runSend>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_2, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:45:7: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1, 2>, 1>::Primitives' requested here 45 | prims(tid, tn, nullptr, &work->sendRank, work->sendAddr, nullptr, | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:261:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 2>::runSend>' requested here 261 | runSend, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1, 4>, 1>::Primitives' requested here 45 | prims(tid, tn, nullptr, &work->sendRank, work->sendAddr, nullptr, | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:261:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 4>::runSend>' requested here 261 | runSend>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:4:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 4 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_4, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid),/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hple<1,1,COL:670:L_UN15ROL: warning: initializer order does not match the declaration order [-Wreorder-ctor]L>>(sub t 670id | tid(tid), ,n subtn, grtoup,h readswork()nthre; | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:3:In file included from ads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp == 0 ? :ncclShmem2.c1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested hereo: 3 | DEFIn file included from mINE_ncmcl/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.hDevFu nthread.s(nthreads), tidInBlncock(threadIdx.x), group(group), | ^~~~~~~~~~~ :(SebndR11ecuv_RING_S: IMPLEIn file included from _fSum_i8_/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h2f, ncSclF:uncSend173Recv, Fu: incSum,/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670ze | s[NCCL_ PROTO_S iInt8_t, MNCCL_APLG O_RING,L NCCEL_PRO ]/TtNiCOd_(tid), nthreaCL_STEPS/sizeof(T) : stepSIMPLE,S 2) | ^ i/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc'z e_)611 | { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h RunWorkBatch, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1, 4>, 1>::Primitives' requested here e 103 | ofcoll, ty, redop, algo, proto, unroll>().run(); \ | ^ (T) /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' : 670 | tid(tid), nth stereads(nthreads), tidInBlock(threadIdx.xpSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ )| , group(group group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h: 45670: | 7 : note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1, 2>, 1>::Primitives' requested heret id(tid), 45n | t h r e a d sp(rnitmhsr(etaidds,) ,t nt,i dnIunlBllpoctkr(,t &hwroerakd-Id>xse.nxd)R,a ngkr,o uwpo(rgkro-u>pse)n,d A d| d ^~~~~~~~~~~ r, nullptr, | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:261:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 2>::runSend>' requested here 261 | runSend>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_2, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ prims(tid, tn, &work->recvRank, nullptr, nullptr, work->recvAddr, | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:271:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 2>::runRecv>' requested here 271 | runRecv>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_2, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:45:7: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1, 2>, 1>::Primitives' requested here 45 | prims(tid, tn, nullptr, &work->sendRank, work->sendAddr, nullptr, | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:261:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 2>::runSend>' requested here 261 | runSend>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_2, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 13 warnings generated when compiling for host. /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:103:7: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1, 2>, 1>::Primitives' requested here 103 | prims(tid, tn, &work->recvRank, nullptr, nullptr, work->recvAddr, | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:273:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 2>::runRecv>' requested here 273 | runRecv>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_2, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:2: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:11: In file included from /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/primitives.h:173: /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:45:7: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1, 4>, 1>::Primitives' requested here 45 | prims(tid, tn, nullptr, &work->sendRank, work->sendAddr, nullptr, | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:259:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 2>::runSend>' requested here 259 | runSend>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_2, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALG/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().:670run();:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h670: | tid(ti670:15: note: dfield 'nthreads' will be initialized after field 'tidInBlock' 670 | ) , ntthirdeads(nthreads), tidInBloc(tid), nthreads(nthreads), tidInBlock(threk(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: ? nc warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | clShmem.co tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:103:7: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1, 2>, 1>::Primitives' requested here 103 | prims(tid, tn, &work->recvRank, nullptr, nullptr, work->recvAddr, | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:273:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 2>::runRecv>' requested here 273 | runRecv>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_2, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ mm.buffSizes[NCadCIdx.x), Lgroup(gr_oup), | P ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60R: note: field 'group' will be initialized after field 'stepSize' 670O | T tid(tiOd), _nthreaSds(nthreIads), tidInMBlock(thPreaLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:dIdx103.x), gr:oup(gr7oup), | : ^~~~~~~~~~~ note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1, 2>, 1>::Primitives' requested here 103 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h prims(tid, tn, &work->recvRank, nullptr, nullptr, work->recvAddr, | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:273:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 2>::runRecv>' requested here 273 | runRecv>(subtid, subtn, group, wor/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670k:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] ); | ^ 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:103:7: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1, 2>, 1>::Primitives' requested here 103 | prims(tid, tn, &work->recvRank, nullptr, nullptr, work->recvAddr, | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:273:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 2>::runRecv>' requested here 273 | runRecv>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 3 | DEFINE_/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 3 | DE:670:15: ncclDevFunc(SendRecv_RING_SIMPLE_Sum_iwarning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | 8_2, ncclFuncSendRecv, Fun tid(tid)cSum, int8_t, NC, nthreadCL_ALGO_RING, NCCL_PROs(nthreadsTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLEFINE_]ncclDev/Func(SenNdRecv_RICNG_SIMPLCE_Sum_iL8_2, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:103:7: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1, 4>, 1>::Primitives' requested here 103 | prims(tid, tn, &work->recvRan/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.hk, :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:103:7: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1, 2>, 1>::Primitives' requested here 103 | prims(tid, tn, &work->recvRank, nullptr, nullptr, work->recvAddr, | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:273:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 2>::runRecv>' requested here 273 | runRecv>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_2, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ nullptr, nullptr, work->recvAddr, | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:273:9:, u nroll>().runote: n(); \ in instantiation of function template specialization 'RunWorkBatch, 1, 2, 4>::runRecv>' requested here| ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h :670:15 : note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 273 | runRecv>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:4:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 4 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_4, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:103:7: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1, 2>, 1>::Primitives' requested here 103 | prims(tid, tn, &work->recvRank, nullptr, nullptr, work->recvAddr, | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:273:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 2>::runRecv>' requested here 273 | runRecv>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_2, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:45:7: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1, 4>, 1>::Primitives' requested here 45 | prims(tid, tn, nullptr, &work->sendRank, work->sendAddr, nullptr, | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:261:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 4>::runSend>' requested here 261 | runSend>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:4:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 4 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_4, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:45:7: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1, 4>, 1>::Primitives' requested here 45 | prims(tid, tn, nullptr, &work->sendRank, work->sendAddr, nullptr, | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:261:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 4>::runSend>' requested here 261 | runSend>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:4:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 4 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_4, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:45:7: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1, 4>, 1>::Primitives' requested here 45 | /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) prims(tid, tn, nullptr, &work->sendRank, work->sendAddr, nullptr, | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:261:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 4>::runSend>' requested here 261 | runSend>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:4:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 4 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_4, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:45:7: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1, 4>, 1>::Primitives' requested here 45 | prims(tid, tn, nullptr, &work->sendRank, work->sendAddr, nullptr, | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:261:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 4>::runSend>' requested here 261 | runSend>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:4:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 4 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_4, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:45:7: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1, 4>, 1>::Primitives' requested here 45 | prims(tid, tn, nullptr, &work->sendRank, work->sendAddr, nullptr, | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:261:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 4>::runSend>' requested here 261 | runSend>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:4:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 4 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_4, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:103:7: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1, 4>, 1>::Primitives' requested here 103 | prims(tid, tn, &work->recvRank, nullptr, nullptr, work->recvAddr, | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:271:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 2>::runRecv>' requested here 271 | runRecv>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:3:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 2>::run' requested here 3 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_2, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 2) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:103:7: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1, 4>, 1>::Primitives' requested here 103 | prims(tid, tn, &work->recvRank, nullptr, nullptr, work->recvAddr, | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:273:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 4>::runRecv>' requested here 273 | runRecv>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:4:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 4 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_4, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ :670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthre 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:45:7: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1, 4>, 1>::Primitives' requested here ads), tidInBlock(threadIdx.45 | prims(tid, tn, nullptr, &work->sendRank, work->sendAddr, nullptr, | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:261:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 4>::runSend>' requested here 261 | runSend>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:4:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 4 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_4, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611_ == 0 ? nc:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | clShmem.RunWorkBcomm.buffSizesatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670[ | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:103:7: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1, 4>, 1>::Primitives' requested here 103 | prims(tid, tn, &work->recvRank, nullptr, nullptr, work->recvAddr, | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:273:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 4>::runRecv>' requested here 273 | runRecv>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:4:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 4 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_4, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:103:7: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1, 4>, 1>::Primitives' requested here 103 | prims(tid, tn, &work->recvRank, nullptr, nullptr, work->recvAddr, | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:273:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 4>::runRecv>' requested here 273 | runRecv>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:4:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 4 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_4, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:103:7: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1, 4>, 1>::Primitives' requested here 103 | prims(tid, tn, &work->recvRank, nullptr, nullptr, work->recvAddr, | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:273:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 4>::runRecv>' requested here 273 | runRecv>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:4:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 4 | DEFINE/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:103:7: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1, 4>, 1>::Primitives' requested here 103 | prims(tid, tn, &work->recvRank, nullptr, nullptr, work->recvAddr, | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:273:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 4>::runRecv>' requested here 273 | runRecv>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:4:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 4 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_4, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_4, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_ 671 | stepSize(stepSize_ == 0 ? ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) : stepSize_) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:103:7: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1, 4>, 1>::Primitives' requested here 103 | prims(tid, tn, &work->recvRank, nullptr, nullptr, work->recvAddr, | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/sendrecv.h:273:9: note: in instantiation of function template specialization 'RunWorkBatch, 1, 2, 4>::runRecv>' requested here 273 | runRecv>(subtid, subtn, group, work); | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/gensrc/sendrecv_sum_i8.cpp:4:1: note: in instantiation of member function 'RunWorkBatch, 1, 2, 4>::run' requested here 4 | DEFINE_ncclDevFunc(SendRecv_RING_SIMPLE_Sum_i8_4, ncclFuncSendRecv, FuncSum, int8_t, NCCL_ALGO_RING, NCCL_PROTO_SIMPLE, 4) | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/common.h:611:62: note: expanded from macro 'DEFINE_ncclDevFunc' 611 | RunWorkBatch, algo, proto, unroll>().run(); \ | ^ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/hipify/src/device/prims_simple.h:670:60: note: field 'group' will be initialized after field 'stepSize' 670 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx90a. 11 warnings generated when compiling for gfx942. 13 warnings generated when compiling for gfx1100. 13 warnings generated when compiling for gfx1200. 13 warnings generated when compiling for gfx1201. 13 warnings generated when compiling for gfx1102. 13 warnings generated when compiling for gfx906. 13 warnings generated when compiling for gfx1101. 13 warnings generated when compiling for gfx1030. [100%] Linking CXX shared library librccl.so /usr/bin/cmake -E cmake_link_script CMakeFiles/rccl.dir/link.txt --verbose=1 /usr/bin/cmake -E time /usr/bin/hipcc -fPIC -O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -O2 -g -DNDEBUG -parallel-jobs=1 -Xoffload-linker -mllvm=-amdgpu-kernarg-preload-count=16 -Xlinker --dependency-file=CMakeFiles/rccl.dir/link.d -Wl,-z,relro -Wl,--as-needed -Wl,-z,pack-relative-relocs -Wl,-z,now -Wl,-z,now -Wl,--build-id=sha1 -specs=/usr/lib/rpm/redhat/redhat-package-notes -shared -Wl,-soname,librccl.so.1 -o librccl.so.1.0 CMakeFiles/rccl.dir/hipify/src/bootstrap.cc.o CMakeFiles/rccl.dir/hipify/src/channel.cc.o CMakeFiles/rccl.dir/hipify/src/collectives.cc.o CMakeFiles/rccl.dir/hipify/src/debug.cc.o CMakeFiles/rccl.dir/hipify/src/enqueue.cc.o CMakeFiles/rccl.dir/hipify/src/group.cc.o CMakeFiles/rccl.dir/hipify/src/init.cc.o CMakeFiles/rccl.dir/hipify/src/init_nvtx.cc.o CMakeFiles/rccl.dir/hipify/src/net.cc.o CMakeFiles/rccl.dir/hipify/src/msccl.cc.o CMakeFiles/rccl.dir/hipify/src/proxy.cc.o CMakeFiles/rccl.dir/hipify/src/register.cc.o CMakeFiles/rccl.dir/hipify/src/transport.cc.o CMakeFiles/rccl.dir/hipify/src/device/common.cu.cpp.o CMakeFiles/rccl.dir/hipify/src/device/onerank.cu.cpp.o CMakeFiles/rccl.dir/hipify/src/graph/connect.cc.o CMakeFiles/rccl.dir/hipify/src/graph/paths.cc.o CMakeFiles/rccl.dir/hipify/src/graph/rings.cc.o CMakeFiles/rccl.dir/hipify/src/graph/rome_models.cc.o CMakeFiles/rccl.dir/hipify/src/graph/search.cc.o CMakeFiles/rccl.dir/hipify/src/graph/topo.cc.o CMakeFiles/rccl.dir/hipify/src/graph/trees.cc.o CMakeFiles/rccl.dir/hipify/src/graph/tuning.cc.o CMakeFiles/rccl.dir/hipify/src/graph/xml.cc.o CMakeFiles/rccl.dir/hipify/src/misc/alt_rsmi.cc.o CMakeFiles/rccl.dir/hipify/src/misc/archinfo.cc.o CMakeFiles/rccl.dir/hipify/src/misc/argcheck.cc.o CMakeFiles/rccl.dir/hipify/src/misc/api_trace.cc.o CMakeFiles/rccl.dir/hipify/src/misc/ibvsymbols.cc.o CMakeFiles/rccl.dir/hipify/src/misc/ibvwrap.cc.o CMakeFiles/rccl.dir/hipify/src/misc/ipcsocket.cc.o CMakeFiles/rccl.dir/hipify/src/misc/npkit.cc.o CMakeFiles/rccl.dir/hipify/src/misc/nvmlwrap_stub.cc.o CMakeFiles/rccl.dir/hipify/src/misc/param.cc.o CMakeFiles/rccl.dir/hipify/src/misc/profiler.cc.o CMakeFiles/rccl.dir/hipify/src/misc/rocm_smi_wrap.cc.o CMakeFiles/rccl.dir/hipify/src/misc/rocmwrap.cc.o CMakeFiles/rccl.dir/hipify/src/misc/roctx.cc.o CMakeFiles/rccl.dir/hipify/src/misc/shmutils.cc.o CMakeFiles/rccl.dir/hipify/src/misc/signals.cc.o CMakeFiles/rccl.dir/hipify/src/misc/socket.cc.o CMakeFiles/rccl.dir/hipify/src/misc/strongstream.cc.o CMakeFiles/rccl.dir/hipify/src/misc/tuner.cc.o CMakeFiles/rccl.dir/hipify/src/misc/utils.cc.o CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_lifecycle.cc.o CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_parser.cc.o CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_setup.cc.o CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_status.cc.o CMakeFiles/rccl.dir/hipify/src/transport/coll_net.cc.o CMakeFiles/rccl.dir/hipify/src/transport/generic.cc.o CMakeFiles/rccl.dir/hipify/src/transport/net_tmp.cc.o CMakeFiles/rccl.dir/hipify/src/transport/net_ib.cc.o CMakeFiles/rccl.dir/hipify/src/transport/net_socket.cc.o CMakeFiles/rccl.dir/hipify/src/transport/nvls.cc.o CMakeFiles/rccl.dir/hipify/src/transport/p2p.cc.o CMakeFiles/rccl.dir/hipify/src/transport/shm.cc.o CMakeFiles/rccl.dir/hipify/gensrc/all_gather_sum_i8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_bf16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_bf8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_f16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_f32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_f64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_f8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_u32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_u64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_minmax_u8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_bf16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_bf8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_f16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_f32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_f64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_f8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_u32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_u64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_premulsum_u8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_bf16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_bf8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_f16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_f32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_f64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_f8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_u32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_u64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_prod_u8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_bf16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_bf8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_f16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_f32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_f64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_f8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_u32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_u64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sum_u8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_i32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_i64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_i8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_u32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_u64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/all_reduce_sumpostdiv_u8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/alltoall_pivot_sum_i8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/broadcast_sum_i8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/device_table.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/host_table.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_bf16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_bf8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_f16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_f32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_f64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_f8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_i32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_i64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_i8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_u32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_u64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_MinMax_u8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_bf16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_bf8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_f16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_f32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_f64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_f8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_i32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_i64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_i8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_u32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_u64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Prod_u8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_bf16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_bf8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_f16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_f32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_f64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_f8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_i32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_i64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_i8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_u32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_u64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/msccl_kernel_Sum_u8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_bf16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_bf8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_f16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_f32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_f64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_f8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_u32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_u64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_minmax_u8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_bf16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_bf8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_f16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_f32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_f64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_f8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_u32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_u64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_premulsum_u8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_bf16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_bf8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_f16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_f32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_f64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_f8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_u32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_u64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_prod_u8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_bf16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_bf8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_f16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_f32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_f64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_f8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_u32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_u64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_minmax_u8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_bf16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_bf8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_f16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_f32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_f64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_f8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_u32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_u64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_premulsum_u8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_bf16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_bf8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_f16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_f32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_f64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_f8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_u32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_u64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_prod_u8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_bf16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_bf8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_f16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/redclang++: warning: argument unused during compilation: '-Xarch_host -fstack-protector-strong' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-Xarch_host -fcf-protection' [-Wunused-command-line-argument] clang++: warning: argument unused during compilation: '-specs=/usr/lib/rpm/redhat/redhat-package-notes' [-Wunused-command-line-argument] Elapsed time (seconds): 6476.18 uce_scatter_sum_f32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_f64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_f8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_u32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_u64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sum_u8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_i32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_i64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_i8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_u32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_u64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_scatter_sumpostdiv_u8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_bf16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_bf8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_f16.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_f32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_f64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_f8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_u32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_u64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sum_u8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_i32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_i64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_i8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_u32.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_u64.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/reduce_sumpostdiv_u8.cpp.o CMakeFiles/rccl.dir/hipify/gensrc/sendrecv_sum_i8.cpp.o CMakeFiles/rccl.dir/git_version.cpp.o -fgpu-rdc -ldl /usr/lib64/librocm_smi64.so.1.0 /usr/lib64/libamdhip64.so.6.4.43483 --hip-link --offload-arch=gfx906 --offload-arch=gfx908 --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 --offload-arch=gfx1200 --offload-arch=gfx1201 /usr/bin/cmake -E cmake_symlink_library librccl.so.1.0 librccl.so.1 librccl.so gmake[2]: Leaving directory '/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build' [100%] Built target rccl gmake[1]: Leaving directory '/builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build' /usr/bin/cmake -E cmake_progress_start /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/redhat-linux-build/CMakeFiles 0 + RPM_EC=0 ++ jobs -p + exit 0 Executing(%install): /bin/sh -e /var/tmp/rpm-tmp.AFqZSA + umask 022 + cd /builddir/build/BUILD/rccl-6.4.1-build + '[' /builddir/build/BUILD/rccl-6.4.1-build/BUILDROOT '!=' / ']' + rm -rf /builddir/build/BUILD/rccl-6.4.1-build/BUILDROOT ++ dirname /builddir/build/BUILD/rccl-6.4.1-build/BUILDROOT + mkdir -p /builddir/build/BUILD/rccl-6.4.1-build + mkdir /builddir/build/BUILD/rccl-6.4.1-build/BUILDROOT + CFLAGS='-O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer ' + export CFLAGS + CXXFLAGS='-O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Werror=format-security -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -Xarch_host -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -Xarch_host -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer' + export CXXFLAGS + FFLAGS='-O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -I/usr/lib64/gfortran/modules ' + export FFLAGS + FCFLAGS='-O2 -fexceptions -g -grecord-gcc-switches -pipe -Wall -Wp,-U_FORTIFY_SOURCE,-D_FORTIFY_SOURCE=3 -Wp,-D_GLIBCXX_ASSERTIONS --config /usr/lib/rpm/redhat/redhat-hardened-clang.cfg -fstack-protector-strong -m64 -march=x86-64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -fcf-protection -fno-omit-frame-pointer -mno-omit-leaf-frame-pointer -I/usr/lib64/gfortran/modules ' + export FCFLAGS + VALAFLAGS=-g + export VALAFLAGS + RUSTFLAGS='-Copt-level=3 -Cdebuginfo=2 -Ccodegen-units=1 -Cstrip=none -Cforce-frame-pointers=yes -Clink-arg=-specs=/usr/lib/rpm/redhat/redhat-package-notes --cap-lints=warn' + export RUSTFLAGS + LDFLAGS='-Wl,-z,relro -Wl,--as-needed -Wl,-z,pack-relative-relocs -Wl,-z,now -Wl,-z,now -Wl,--build-id=sha1 -specs=/usr/lib/rpm/redhat/redhat-package-notes ' + export LDFLAGS + LT_SYS_LIBRARY_PATH=/usr/lib64: + export LT_SYS_LIBRARY_PATH + CC=hipcc + export CC + CXX=hipcc + export CXX + cd rccl-rocm-6.4.1 + DESTDIR=/builddir/build/BUILD/rccl-6.4.1-build/BUILDROOT + /usr/bin/cmake --install redhat-linux-build -- Install configuration: "RelWithDebInfo" -- Installing: /builddir/build/BUILD/rccl-6.4.1-build/BUILDROOT/usr/lib64/librccl.so.1.0 -- Installing: /builddir/build/BUILD/rccl-6.4.1-build/BUILDROOT/usr/lib64/librccl.so.1 -- Installing: /builddir/build/BUILD/rccl-6.4.1-build/BUILDROOT/usr/lib64/librccl.so -- Installing: /builddir/build/BUILD/rccl-6.4.1-build/BUILDROOT/usr/include/rccl/rccl.h -- Installing: /builddir/build/BUILD/rccl-6.4.1-build/BUILDROOT/usr/include/rccl/nccl_net.h -- Installing: /builddir/build/BUILD/rccl-6.4.1-build/BUILDROOT/usr/include/rccl/amd_detail/api_trace.h -- Installing: /builddir/build/BUILD/rccl-6.4.1-build/BUILDROOT/usr/share/rccl/msccl-algorithms -- Installing: /builddir/build/BUILD/rccl-6.4.1-build/BUILDROOT/usr/share/rccl/msccl-algorithms/allreduce-allpairs-8n-ll-32tb-op.xml -- Installing: /builddir/build/BUILD/rccl-6.4.1-build/BUILDROOT/usr/share/rccl/msccl-algorithms/allreduce-allpairs-8n-ll-32tb.xml -- Installing: /builddir/build/BUILD/rccl-6.4.1-build/BUILDROOT/usr/share/rccl/msccl-algorithms/allreduce-allpairs-8n-ll-64tb-op.xml -- Installing: /builddir/build/BUILD/rccl-6.4.1-build/BUILDROOT/usr/share/rccl/msccl-algorithms/allreduce-allpairs-8n-ll-64tb.xml -- Installing: /builddir/build/BUILD/rccl-6.4.1-build/BUILDROOT/usr/share/rccl/msccl-algorithms/allreduce-allpairs-8n-simple-op.xml -- Installing: /builddir/build/BUILD/rccl-6.4.1-build/BUILDROOT/usr/share/rccl/msccl-algorithms/allreduce-allpairs-8n-simple.xml -- Installing: /builddir/build/BUILD/rccl-6.4.1-build/BUILDROOT/usr/share/rccl/msccl-algorithms/allreduce-allpairs-8n-simple_2.xml -- Installing: /builddir/build/BUILD/rccl-6.4.1-build/BUILDROOT/usr/share/rccl/msccl-algorithms/alltoall-8n-0-9kb.xml -- Installing: /builddir/build/BUILD/rccl-6.4.1-build/BUILDROOT/usr/share/rccl/msccl-algorithms/alltoall-8n-190kb-512kb.xml -- Installing: /builddir/build/BUILD/rccl-6.4.1-build/BUILDROOT/usr/share/rccl/msccl-algorithms/alltoall-8n-512kb-7mb.xml -- Installing: /builddir/build/BUILD/rccl-6.4.1-build/BUILDROOT/usr/share/rccl/msccl-algorithms/alltoall-8n-7mb-43mb.xml -- Installing: /builddir/build/BUILD/rccl-6.4.1-build/BUILDROOT/usr/share/rccl/msccl-algorithms/alltoall-8n-9kb-190kb.xml -- Installing: /builddir/build/BUILD/rccl-6.4.1-build/BUILDROOT/usr/share/rccl/msccl-unit-test-algorithms -- Installing: /builddir/build/BUILD/rccl-6.4.1-build/BUILDROOT/usr/share/rccl/msccl-unit-test-algorithms/all-reduce-ring-ll.xml -- Installing: /builddir/build/BUILD/rccl-6.4.1-build/BUILDROOT/usr/share/rccl/msccl-unit-test-algorithms/all-reduce-ring-ll128.xml -- Installing: /builddir/build/BUILD/rccl-6.4.1-build/BUILDROOT/usr/share/rccl/msccl-unit-test-algorithms/all-reduce-ring-simple.xml -- Installing: /builddir/build/BUILD/rccl-6.4.1-build/BUILDROOT/usr/lib64/cmake/rccl/rccl-targets.cmake -- Installing: /builddir/build/BUILD/rccl-6.4.1-build/BUILDROOT/usr/lib64/cmake/rccl/rccl-targets-relwithdebinfo.cmake -- Installing: /builddir/build/BUILD/rccl-6.4.1-build/BUILDROOT/usr/lib64/cmake/rccl/rccl-config.cmake -- Installing: /builddir/build/BUILD/rccl-6.4.1-build/BUILDROOT/usr/lib64/cmake/rccl/rccl-config-version.cmake -- Installing: /builddir/build/BUILD/rccl-6.4.1-build/BUILDROOT/usr/share/doc/rccl/LICENSE.txt + echo s@/builddir/build/BUILD/rccl-6.4.1-build/BUILDROOT@@ + find /builddir/build/BUILD/rccl-6.4.1-build/BUILDROOT/usr/lib64 -name '*.so.*.[0-9]' + sed -f br.sed + find /builddir/build/BUILD/rccl-6.4.1-build/BUILDROOT/usr/lib64 -name '*.so.[0-9]' + sed -f br.sed + find /builddir/build/BUILD/rccl-6.4.1-build/BUILDROOT/usr/lib64 -name '*.so' + sed -f br.sed + find /builddir/build/BUILD/rccl-6.4.1-build/BUILDROOT/usr/lib64 -name '*.cmake' + sed -f br.sed + '[' -f /builddir/build/BUILD/rccl-6.4.1-build/BUILDROOT/usr/share/doc/rccl/LICENSE.txt ']' + rm /builddir/build/BUILD/rccl-6.4.1-build/BUILDROOT/usr/share/doc/rccl/LICENSE.txt + /usr/bin/find-debuginfo -j2 --strict-build-id -m -i --build-id-seed 6.4.1-3.fc43 --unique-debug-suffix -6.4.1-3.fc43.x86_64 --unique-debug-src-base rccl-6.4.1-3.fc43.x86_64 --run-dwz --dwz-low-mem-die-limit 10000000 --dwz-max-die-limit 110000000 -S debugsourcefiles.list /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1 find-debuginfo: starting Extracting debug info from 1 files DWARF-compressing 1 files dwz: ./usr/lib64/librccl.so.1.0-6.4.1-3.fc43.x86_64.debug: Unknown debugging section .debug_str_offsets sepdebugcrcfix: Updated 0 CRC32s, 1 CRC32s did match. Creating .debug symlinks for symlinks to ELF files Copying sources found by 'debugedit -l' to /usr/src/debug/rccl-6.4.1-3.fc43.x86_64 find-debuginfo: done + /usr/lib/rpm/check-buildroot + /usr/lib/rpm/redhat/brp-ldconfig + /usr/lib/rpm/brp-compress + /usr/lib/rpm/redhat/brp-strip-lto /usr/bin/strip + /usr/lib/rpm/brp-strip-static-archive /usr/bin/strip + /usr/lib/rpm/check-rpaths + /usr/lib/rpm/redhat/brp-mangle-shebangs + /usr/lib/rpm/brp-remove-la-files + /usr/lib/rpm/redhat/brp-python-rpm-in-distinfo + env /usr/lib/rpm/redhat/brp-python-bytecompile '' 1 0 -j2 + /usr/lib/rpm/redhat/brp-python-hardlink + /usr/bin/add-determinism --brp -j2 /builddir/build/BUILD/rccl-6.4.1-build/BUILDROOT Scanned 38 directories and 314 files, processed 0 inodes, 0 modified (0 replaced + 0 rewritten), 0 unsupported format, 0 errors Reading /builddir/build/BUILD/rccl-6.4.1-build/SPECPARTS/rpm-debuginfo.specpart Processing files: rccl-6.4.1-3.fc43.x86_64 Executing(%license): /bin/sh -e /var/tmp/rpm-tmp.AuHDLU + umask 022 + cd /builddir/build/BUILD/rccl-6.4.1-build + cd rccl-rocm-6.4.1 + LICENSEDIR=/builddir/build/BUILD/rccl-6.4.1-build/BUILDROOT/usr/share/licenses/rccl + export LC_ALL=C.UTF-8 + LC_ALL=C.UTF-8 + export LICENSEDIR + /usr/bin/mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/BUILDROOT/usr/share/licenses/rccl + cp -pr /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/LICENSE.txt /builddir/build/BUILD/rccl-6.4.1-build/BUILDROOT/usr/share/licenses/rccl + RPM_EC=0 ++ jobs -p + exit 0 Provides: librccl.so.1()(64bit) rccl = 6.4.1-3.fc43 rccl(x86-64) = 6.4.1-3.fc43 Requires(interp): /sbin/ldconfig /sbin/ldconfig Requires(rpmlib): rpmlib(CompressedFileNames) <= 3.0.4-1 rpmlib(FileDigests) <= 4.6.0-1 rpmlib(PayloadFilesHavePrefix) <= 4.0-1 Requires(post): /sbin/ldconfig Requires(postun): /sbin/ldconfig Requires: glibc >= 2.41.9000-15 ld-linux-x86-64.so.2()(64bit) ld-linux-x86-64.so.2(GLIBC_2.3)(64bit) libamdhip64.so.6()(64bit) libamdhip64.so.6(hip_4.2)(64bit) libamdhip64.so.6(hip_4.3)(64bit) libamdhip64.so.6(hip_4.5)(64bit) libamdhip64.so.6(hip_5.0)(64bit) libamdhip64.so.6(hip_5.3)(64bit) libamdhip64.so.6(hip_6.0)(64bit) libc.so.6()(64bit) libc.so.6(GLIBC_2.10)(64bit) libc.so.6(GLIBC_2.14)(64bit) libc.so.6(GLIBC_2.16)(64bit) libc.so.6(GLIBC_2.17)(64bit) libc.so.6(GLIBC_2.2.5)(64bit) libc.so.6(GLIBC_2.3)(64bit) libc.so.6(GLIBC_2.3.2)(64bit) libc.so.6(GLIBC_2.3.4)(64bit) libc.so.6(GLIBC_2.32)(64bit) libc.so.6(GLIBC_2.33)(64bit) libc.so.6(GLIBC_2.34)(64bit) libc.so.6(GLIBC_2.38)(64bit) libc.so.6(GLIBC_2.4)(64bit) libc.so.6(GLIBC_2.42)(64bit) libc.so.6(GLIBC_2.6)(64bit) libc.so.6(GLIBC_2.7)(64bit) libc.so.6(GLIBC_ABI_DT_RELR)(64bit) libgcc_s.so.1()(64bit) libgcc_s.so.1(GCC_12.0.0)(64bit) libgcc_s.so.1(GCC_3.0)(64bit) libm.so.6()(64bit) libm.so.6(GLIBC_2.2.5)(64bit) librocm_smi64.so.1()(64bit) libstdc++.so.6()(64bit) libstdc++.so.6(CXXABI_1.3)(64bit) libstdc++.so.6(CXXABI_1.3.7)(64bit) libstdc++.so.6(CXXABI_1.3.9)(64bit) libstdc++.so.6(GLIBCXX_3.4)(64bit) libstdc++.so.6(GLIBCXX_3.4.11)(64bit) libstdc++.so.6(GLIBCXX_3.4.18)(64bit) libstdc++.so.6(GLIBCXX_3.4.19)(64bit) libstdc++.so.6(GLIBCXX_3.4.21)(64bit) libstdc++.so.6(GLIBCXX_3.4.22)(64bit) libstdc++.so.6(GLIBCXX_3.4.26)(64bit) libstdc++.so.6(GLIBCXX_3.4.29)(64bit) libstdc++.so.6(GLIBCXX_3.4.30)(64bit) libstdc++.so.6(GLIBCXX_3.4.32)(64bit) libstdc++.so.6(GLIBCXX_3.4.9)(64bit) Processing files: rccl-devel-6.4.1-3.fc43.x86_64 Executing(%doc): /bin/sh -e /var/tmp/rpm-tmp.IR62l5 + umask 022 + cd /builddir/build/BUILD/rccl-6.4.1-build + cd rccl-rocm-6.4.1 + DOCDIR=/builddir/build/BUILD/rccl-6.4.1-build/BUILDROOT/usr/share/doc/rccl-devel + export LC_ALL=C.UTF-8 + LC_ALL=C.UTF-8 + export DOCDIR + /usr/bin/mkdir -p /builddir/build/BUILD/rccl-6.4.1-build/BUILDROOT/usr/share/doc/rccl-devel + cp -pr /builddir/build/BUILD/rccl-6.4.1-build/rccl-rocm-6.4.1/README.md /builddir/build/BUILD/rccl-6.4.1-build/BUILDROOT/usr/share/doc/rccl-devel + RPM_EC=0 ++ jobs -p + exit 0 Provides: cmake(rccl) = 2.22.3 rccl-devel = 6.4.1-3.fc43 rccl-devel(x86-64) = 6.4.1-3.fc43 Requires(rpmlib): rpmlib(CompressedFileNames) <= 3.0.4-1 rpmlib(FileDigests) <= 4.6.0-1 rpmlib(PayloadFilesHavePrefix) <= 4.0-1 Requires: cmake-filesystem(x86-64) librccl.so.1()(64bit) Processing files: rccl-data-6.4.1-3.fc43.noarch Provides: rccl-data = 6.4.1-3.fc43 Requires(rpmlib): rpmlib(CompressedFileNames) <= 3.0.4-1 rpmlib(FileDigests) <= 4.6.0-1 rpmlib(PayloadFilesHavePrefix) <= 4.0-1 Processing files: rccl-debugsource-6.4.1-3.fc43.x86_64 Provides: rccl-debugsource = 6.4.1-3.fc43 rccl-debugsource(x86-64) = 6.4.1-3.fc43 Requires(rpmlib): rpmlib(CompressedFileNames) <= 3.0.4-1 rpmlib(FileDigests) <= 4.6.0-1 rpmlib(PayloadFilesHavePrefix) <= 4.0-1 Processing files: rccl-debuginfo-6.4.1-3.fc43.x86_64 Provides: debuginfo(build-id) = 442d17da5c3cb98fea8f3721b064fbe57930e2eb librccl.so.1.0-6.4.1-3.fc43.x86_64.debug()(64bit) rccl-debuginfo = 6.4.1-3.fc43 rccl-debuginfo(x86-64) = 6.4.1-3.fc43 Requires(rpmlib): rpmlib(CompressedFileNames) <= 3.0.4-1 rpmlib(FileDigests) <= 4.6.0-1 rpmlib(PayloadFilesHavePrefix) <= 4.0-1 Recommends: rccl-debugsource(x86-64) = 6.4.1-3.fc43 Checking for unpackaged file(s): /usr/lib/rpm/check-files /builddir/build/BUILD/rccl-6.4.1-build/BUILDROOT Wrote: /builddir/build/RPMS/rccl-data-6.4.1-3.fc43.noarch.rpm Wrote: /builddir/build/RPMS/rccl-debuginfo-6.4.1-3.fc43.x86_64.rpm Wrote: /builddir/build/RPMS/rccl-debugsource-6.4.1-3.fc43.x86_64.rpm Wrote: /builddir/build/RPMS/rccl-devel-6.4.1-3.fc43.x86_64.rpm Wrote: /builddir/build/RPMS/rccl-6.4.1-3.fc43.x86_64.rpm Executing(rmbuild): /bin/sh -e /var/tmp/rpm-tmp.z071RV + umask 022 + cd /builddir/build/BUILD/rccl-6.4.1-build + test -d /builddir/build/BUILD/rccl-6.4.1-build + /usr/bin/chmod -Rf a+rX,u+w,g-w,o-w /builddir/build/BUILD/rccl-6.4.1-build + rm -rf /builddir/build/BUILD/rccl-6.4.1-build + RPM_EC=0 ++ jobs -p + exit 0 Finish: rpmbuild rccl-6.4.1-3.fc43.src.rpm Finish: build phase for rccl-6.4.1-3.fc43.src.rpm INFO: chroot_scan: 1 files copied to /var/lib/copr-rpmbuild/results/chroot_scan INFO: /var/lib/mock/fedora-rawhide-x86_64-1750253281.853958/root/var/log/dnf5.log INFO: chroot_scan: creating tarball /var/lib/copr-rpmbuild/results/chroot_scan.tar.gz /bin/tar: Removing leading `/' from member names INFO: Done(/var/lib/copr-rpmbuild/results/rccl-6.4.1-3.fc43.src.rpm) Config(child) 201 minutes 47 seconds INFO: Results and/or logs in: /var/lib/copr-rpmbuild/results INFO: Cleaning up build root ('cleanup_on_success=True') Start: clean chroot INFO: unmounting tmpfs. Finish: clean chroot Finish: run Running RPMResults tool Package info: { "packages": [ { "name": "rccl", "epoch": null, "version": "6.4.1", "release": "3.fc43", "arch": "x86_64" }, { "name": "rccl-devel", "epoch": null, "version": "6.4.1", "release": "3.fc43", "arch": "x86_64" }, { "name": "rccl", "epoch": null, "version": "6.4.1", "release": "3.fc43", "arch": "src" }, { "name": "rccl-debuginfo", "epoch": null, "version": "6.4.1", "release": "3.fc43", "arch": "x86_64" }, { "name": "rccl-data", "epoch": null, "version": "6.4.1", "release": "3.fc43", "arch": "noarch" }, { "name": "rccl-debugsource", "epoch": null, "version": "6.4.1", "release": "3.fc43", "arch": "x86_64" } ] } RPMResults finished